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ABSTRACT: 

PURPOSETo obtain a gene of a newly elucidated hepatitis C virus (HCV), a specific 
oligonucleotide, a method for detection thereof and a method for determining the genotype of the 
HCV. 

CONSTITUTIONThe polynucleotide has a base sequence described in sequence Nos. 1 to 5. 
The oligonucleotide is described in sequence No 6. The methods for detecting an HCV gene and 
determining the genotype use the polynucleotide and oligonucleotide as a primer, a probe, etc. 
Furthermore, the polypeptide is described in sequence Nos. 15 to 19. Thereby, the HCV gene of 
the newly found genotype can be detected and the HCV genotype Can simultaneously be 
determined over a wide range. 
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1 

NA • HC-G9. 

«ffi»&8gEfl£*r*#y;3?l':*? 1 FYS 1 1 7 

• 5' . 

[1W*«3] E*J#^3iS»©£»K?i|. Zfzttznt 
ffi««^S[SE5(l*Wf S^'JJC^P'^HYS 1 1 7 

• 3' . 

**mft«aiEMs«r**u x * f s r o 3 7 

• 5' . 

«««tt«aiBJii*«r *# U S * U:*? F S R 0 3 7 

• 3' . 

F S&tt* U =f5» ? u^-g 1 F. 

mxm 7 ] U3&£?g 6 mmw* u =r* * f *» 

[11*918] EMS^6E*ffllgSE5iJ!&^rr5*U=f 
^^W*?F#3 2 1. 

[«f 9 ] 6 BKOttSBH&rr S * U =f 

[BjffcEl 0] E^S^6^it©SSE5>J^-rs*'J 
ir U*? Fy^-f T-tE50«^-7 9 IEK© 

mMWM^-t^n 3*9u*<? Yzr=7-i -7-© ia 
[»#3U i] E5ns^6ia«0jgSE?iJs^rrs*u 

=fX ? I/:*? F^-f v-tGffl&n 1 0 1 4fB 
«©*SE?IIS*-r-5*U =fjc * F79-T V-© 

ft. 

[^fl 3] ^«m7«a^u^i 0JglBe©^U 

=f st ^ WW v-irzmm-rz caff**-* 
^x©&e^&aj&. 

[§f#Bl4] tf?Xi*>U7^->3>ffilC 
<fcOcDNA£iii|I^5£££#f8£r5SI*lSgl 2 

mi 3JS!E«©Cl!Jff& , M^X&£^&im 

nt&si 5] n^mi msfctejui 2m§zm<o* 
u^^^i/^F^^-fv-saaaffifflu. ffiaisi© 
«<@sffp»*^i 3Jiis*©cs!iff^'>'fjuxae 
* fctassfcttig 1 4JS82»©csffF^'r;i'xae? 
amm 



(2) #@B¥6-3 1 9 5 6 3 

2 

mxmi 7] 6JBg«©g§»^D-y$ffi 

[St3fcll 8] E50S^1 5S*©# l J'V7*'f FHC 
— G 9 P e o t e in. fc&tffc-ewfiga^T'^'f F. 

9] E*is*ti emmotfv fys 

117-5' Peptide. fcStffc-?-©^^:/:? 
-f F. 

m#m2 0] E*>J#*f 1 7§B*©#'J'*:7;5'-f FYS 
JO 117-3' Peptide. ftetflC^Wgg^^ 
■f F« 

[»#S 2 1 ] E*»J#^ 1 8 a«©# >J -r F S R 
037-5' Peptide. &6OTC-€-©gS»^^ 
-f F. 

2 2 ] E?J#^ 1 9 S«©# m~f9-i FSR 
037-3' Peptide. ft^^lC-€-©8B^^ 
-f F. 

[^^©Pffll/il89!] 

[0 0 0 1] 

20 img±<D$m#m *%mz. cmst&v-i ;ux (a 

T THCVj tBSiETS) WcDNAlgf, ^tl^fll 

js-rsfts^'j^x^i'^F. cmcaurr-sse 
s. ^^-f f. ft^t^iccnesffl^fcHcvjae^ 

[0 0 02] 

[ae*©s«] 1 9 8 8mzHcvm^<o-mmm 

©s^ra^^n^ftisnT^fc. ^ns?. hcv 
T&%&.m* izmt>nz>H c vst#^tb-r 

E^J* S l» H#II6iS©#J£ £ *ntcs^ < #»fft 
fli©Sl5£*iil*5-C*5. £gSIC. tJ<-z>^©HCV«C 

t>v>TnaeT©±E?iw^$nTi3 o. $ sjcffi© 

<W #fC^Tttffl£^E?J©-S8**#9i2n. HCV1$S 
ae^©E^i©#^»S^«HC V#a75 /KE?J© 
Itecsifflanft. ^©te*. flas©&£ffii;:it-<. c 

nS©ffif?ICS^>Tli«3nfcSiE©^»ffettKV^ 

flfi^. C*lSO«*at*ffll»fc«d-Cb. ft*3M« 
•C*ftV»HCV«ai«|*t**i:tfc«WLT*0. <t0 

-5. 

50 [0 0 0 3] 
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(3) 

3 

{%Wi)m& b <k o tt 5KS] H C VttiE-oite^E 

$nfcHcv«ife$^>at3&o. Hcv©jae^tt* 
n-ettusTeftViHcvflwte^wtiMPWU *© 

©B»tt, ^BST©HCV*^jrCfii+#{Cftt&#fr 

^n*^ h c v«c©-a^fiaai*^tc t>5 1 1 1) ic 

b. *©*firF»ttSfWrr*CfcfcJ:D. jEWjSoJE 
[0 004] 

COSiHCV ©«^TH+«-IC^a#tt 5 tlfc ^ H C V 
RNAS**U Ctt*JHttT*»©HCV*©£«e 

^ew**«t&. 3&c*a9i4rsfiK. d©gf«©a 
«ffle*i t^3fe«# s n&sae^ §!©&»© h c va 
£TE?j££Jt«$b&i&&, «woHcv*)ifi»«> 
a^s©^-rn»cfflsi--5 i &©T^:<. ±<me>8ii, 

*58 W# 6 fit 1 c m&te?M IC^S«Tffi©aeTS! 
Kli?5Fftl/ft^»fi : ffiJII«ltt£l/£. ^©EJaJfc^ 30 

l/rWBTSCifciD 1 c£©S^&roe0*?JI6£ 
fcofc. S&fc4H0©»eTffl*#*fc£T©«eTffl 

5*891©* U =f 3t * l'*? F £75K T-t LTltJffl 

■fsiiticiD t— «©«OETae^-s©fd3£***ST' 

SSHtSJWHU #»9JfcSB«L&. *3891©# , J* 

£T5£<iK£DfP/8£?6ffT5 ; &©-e*5. #'J*dr 
Hft StflC^-'J =T* 9 F©«^fcfiiEJi|± 

^©©gsa**? -c n ci t\mm<D c 

fTN&5*>S. ^^©^'Jpt^U^KifcfS^-'J^ 
H 1t *> Wc* <) rf 7. 9 H « 3&*89!©«HIC 
[0 0 0 5] -TfcfcS, *«Wtta»©HCVittlia: 

5SfLv»ae^ffl-p»5 1 caswrsHcvftsHc 50 



«Pffl¥6-3 1 9 5 6 3 

-G9©cDNAae^©^wr»o, £;t-?-©#g« 

a^E3*J©-g&£«i£T5. ft*»l:t;:neffifflQ&: 

^HC«-r*»WTr*0. JWWW:l4EJHH*lfcH 
b 5 ffi«©«SEJU, SfcfiiilttlcfittnftttSBM© 
5 V» U * * U^? 1 F <&5 Hfifc* 

iJ^^V^F©55M-C*-5. *89Jtt. _hlH^-U =f 

g§a^nfc^n-y©SWT*0, *ftE5fJS#6i3« 

©•sssBMffl^aas t»tt-»s*rr -a -7-* 5 

-fv-fc. E50#^7^v^Ll 4£«©a&f : E^€W 
T5 * 'J ^ F #5&5 T- 1 
fc-e-TfJfflL. HCVafS^©tttB. afe?!!©^^ 

at. *«^ttHWrc«ffl-r-scttj:oTHcv©ae 
[0006] ^CTiiz^r-t^ic:. se 

#■«©«*© fc h«*J:0Wft©*ttT?RNA««ta 
b. HC- G 9 {CPHTfit^Woafi^fiJIIft^fil/. 
RHCV*I-IC-G9i#*U a0©2«#{C3HT 
«*©-«©BJS«4»)6L. K«tf*S 117, SR0 
3 7 -e©«SE?!lttE^J#^ 1 ft W L 5 SB 

S©t*t)-C*^. HC-G9©ikaefE5iJtt3' 
fflUlC^anfcTX f-U-y^g8»&^^T9 4 4 0P©Jg 
Sj8»6j£D. 5' «1C3 4 ljttfi)S>6A:«^flR»K 
£. iK^T9 0 3 3fiS«»6aO 3 0 1 175/KS3 
-HfS®«*t, jEt^JiniCg8< 3' «lc 6 6 £8*6 

[0 0 0 7] *58M#tt. Mt«f2K:«1-J:9ic*nn 

©»«T*s±ias«c©is»E5'jtaa©4^©a&? 
mizmtz 1 4«©«isE?jt^*«Lfc. ^©g?«. 

±B»ffl«HHEJO t iE»©aCT-aE W t ©Bfc (42 0 

%BLkfflHMB|igfe6<<&D, *5S«©ae?i5«fc^3p , J5i 

•HB©fc©"P**CiatJlW*n*:. s&c, **w*r 
6 fit. *»W©«ef t *±WJ5ti'U*?Hfcr3V> 
T. -«©ae-?E5>J©**»^T»«HCV«fc»L 

Tftae^Bwottneffofc. *©*s*. 

ri>sa£Ti5«fctf#'J:2^^h'l3. ±IH— S5EJIJ© 
W L T V> -5 H C V i: fe&JST* 3 L TtafeTS 

{C^-SSr^WHCVWae^S^S^WICl clt# 
=Sbfc. 

[0 0 0 8] nisw2ic*-r«fc5ici C 

ishc v«c©ae^w«icsa;fta€ ; f®«<i: i^tn 
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(4) 

5 

[0 0 0 9] HCV©iSfc?&ffl:fc,fctf 

7?--f"r-0tt**64Mftl,&. -ton*. EJH#^7 
£«©&&©* U rf* ? H ©ffi?Utt l c Sic &«fc 

£WM.m V tz. SiM&n 6 S2«© 1 c U ztx 

C0 0 10] **WW33ifirP©lWBertiti:LT#U^ 

7— t?. iJ7?->3>S (PCRS) £8FjI 

Ttt. E*tJ#*f 6 IHS©^" >J =f * i/ H £ffii«#Ht 
[0 0 11] PCRSCftV^TB, Sg-aP&tLT&fi 

ttffliae^sicfli^asn-s. £©«is8©pcRa; 

fcWB-r 'J :** * I*-*? 1 F^7- i Hit 

S-gPg/B i bTS^J#^ 7 ffi*©:* U 2 9 1/*? H 

UT82?>J#^ 6 S«©^- U rf 5C ^ H £WM&^ 9 

[0012] *^w©*ttcj;ntf*^©ifa so 

*V1*>7V*=?Y- (E^#^6) £ffi2»l#^7B«fc 
V> LgS?*J#^ 1 4 ©&Kftia£^S!#JIWtf»J rf* 

-i-v-t&mmiztiffih. pcRj**s^ta»ffl©2S 

PgPCR£®fllLT. -S©^#CJ:DHCVae^ 
[0 0 13] *5!^^tt^Sl©S±lxa*^ 

[0 0 1 4] *%m<DtfV'<7f"( 40 

t37«*SIC*ViTS!^ffll^tt*^rr5*^ X.y<U- 
^ (El) *<tt;E2/NS lT[Kroffl§M<. 
m®mft)-?&ZZ:tZ7f;VTUZ>. El. E2/NS1 

•5. Lfc#oT. #8W©#U^:/^ Ffc<fctf*©iS 
»*U^^-f Fttl c^S«IlCfn:x>^D-^Jii 
#©lftm*©^J5^ HCV7^5 1 >^ffl-r5^t*» 

£fcNS 2~NS 5«««:/n^7--tf$ffl# 50 



#B8¥6-3 1 9 5 6 3 

teVUmmm-PH C VitJgffl^9©BBfglCfflViS C 
[0 0 15] 

lcICHCVg^ft, ?£#±9&jEift&»e^fi©H 
CV*iSSSlCttUIL, *fc^©HCV©^jieTS!t 
I^I^CWSWCWJg-rsct^T^S. *589J©jS£ 

- Mas. 'S^-f F«CVit&?©&tilfc shiest 

[0 0 16] 

[*J6«] «T. *%W©^Jfi«t^^T©^^-5*t. 
t) <i:«fc D*^36«wne©*SSW{C®^Sn5 fcOTftt 

[0 0 17] ^JS«aji 

sas©®^ m\zttm2nt3;^m%i<DmmH c vw&m 

fflb. ^©^:fiSS?!l*5J;Uf-8S©«SSa5iJ2r*©«fc^ 
[0 0 18] (1) RNAffliflffi 

t&ir©h c vvam&m& <fcrx*5sw« s c j: o tbb?6 

Sn«rlffflS'Pffl^«3iE}* (»B?2- 1 5 3 4 0 

i) ^w^^ici-DTgii^iffHgan^ 

^■•J ^51:? V*? K7"7-f ^-*fflVi&HC V^ffl* 
(#ffl¥5-2 3 2 0 0) CiOHCV&Sfc&SKKSn 

VSg^S!W3£llH-r5^j£ (»S-¥3 - 3 0 7 2 9 

6. 4-093960) "ei±^©itfiTS!W^-e^/a: 
^•p&IF^S^t&^WjfiliK^ (HC-G9. YS11 

7. *«tr;SR0 3 7) ^5*©«fc5ICbTRNA*}a 
fflUfc. Jfcf»5 0^ 1 CS^fi© h U Xigtr* (1 0m 
M. pH8. 0) SJD^.. 9 0 X 1 0 3 r pmfCTl 5 
»K©S'L^l8*ffofco n<bflfz^v h\Z2 0 0m 
MONaCl. 1 OmMEDTA. 2% (MmSftW 
©Hx^HSK^-h'J^A (SDS) tlmg/ml© 
•fU^-y— tfKS-^tf hUXSSfffi (5 0mM. pH 

8. 0) Sin*.. 6 0-CT?l BffSJSDHU 7x/-JI// 

V5RNA*a-fe. 

[0 0 19] (2) c DNA©f£SB 
#^J:0^fcRNA&7 0^-ci»raJn®l/fcg. Z. 

n*S»L^S!RNA<J:Lfc„ C©SFSRNA-^>^ 

tC 1 0 On- y h<D®lfcimM (Superscrip 

t;GIBCO. BRL) £<fctfcf-'J 

7-fV-2 0 pmo 1 ^JUA. 4 2"C. milHSiS^-a- 

TcD.NASffe. 

[0 0 2 0] (3) cDNAO#iJ^7- tf^x-OU 
7^->a> (PCR) C<t^li<S 
±S©^ICi:0#enfcmScDNAlC^>^T. 01 
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(5) #BS¥6-3 1 9 56 3 

7 8 
\Z7iiTffi$$\\Z. & 1 \Z7fi-t&t%i&mzmfeLrzil>Z *S [Science, Vol. 239. p487-49 
ffl^-U^^W^?-H^ , 7-f'7-*&WC7>^-fe>X 1 (1 9 8 8) ] CSEoT3 SEJoaKB-tK ^l^fcfc 
®MrV =t* V V*3- F^-f ?>1tZ> 7=74 V-^ -5 P C Rj£lCT*JSLfc. 

T-^mxmm^n-Dt^ mmt. dnaij— vjhk co o 2 1] 

^=7— (Perkin-Elmer. Cetus) l:G [01] 
ene Amp DNAJtHtSaS+y F (P e r k i n [0 0 2 2] 

-E lme r • Ce t us) £fflWTS a i k 1 [^1] 

E#f## : 1 

E5f^§ : 9 4 S 7 

&m. : 

E^SS : c DNA to genomic RNA (HC-69 CONA) 

GCCAGCCCCC 7GA7G6GCGC GACACTCCGC CATGAATCAC TCCCCTGT6A GGAACTACTG 60 
TCrTCACGCA GAAAGCGTCT AGCCAT6GCG TTAGTATGAG TGTCGTGCAG CCTCCAGGAC 120 
CCCCCCTCCC G66AGAGCCA TAGTGG7CTG CGGAACCGGT GAGTACACCG GAATTGCCAG 180 
GACGACCGGG TCCTTTCTTG GATTAACCCG CTCAATGCCT GGAGA7T7GG GCGTGCCCCC 240 
GCAAGACTGC TAGCCGAGTA GTGTTGGGTC GCGAAAGGCC TTGTGGTACT GCCTGATAGG 300 
G7GCTTGCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CATGAGCACG AATCCTAAAC 360 
C7CAAAGAAA AACCAAACGT AACACCAACC GCCGCCCACA GGACGTTAAG TTCCCGGGTG 420 
GCGGCCAGAT CG77GGCGGA GTTTACTTGT TGCCGCGCAG GGGCCCCAGA GTGGGTGTGC 480 
GCGCGACGAG GAAGACTTCC GAGCGG7CGC AACCTCGCGG GAGGCGTCAG CCTATTCCCA 540 
AGGCCCGCCG ACCCGAGGGA AGGTCCTGGG CGCAGCCCGG GTACCCTFGG CCCCTCTATG 600 
GCAACGAGGG CTGTGGGTGG GCGGGATGGC 7CC7G7CCCC CCGCGGCTCT CGGCCTAGTT 660 
GGGGCCCTTC TGACCCCCGG CGGAGGTCAC GCAATTTGGG TAAGGTCATC GATACCCTCA 720 
CGTGTGGCTT CGCCGACCTC ATGGGGTACA TCCCGCTCGT CGGCGCTCCT CTAGGGGGCG 780 
CTGCCAGAGC TCT6GCACAT GGTGTTAGAG TCCTGGAAGA CGGCGTGAAT TACGCAACAG 840 
6GAACC7CCC CGGTTGCTCT TTTTCTATCT TCTTGCTCGC TCTTCTATCC TGCCTGACAG 900 
TCCCTGCTTC GGCCGTCGGA GTGCGCAAC7 CTTCGGGGGT GTACCATGTC ACCAATGATT 960 
GCCCCAATGC GTCCGTTGTG TACGAGACGG AGAACCTGAT CATGCATCTG CCCGGGTGTG 1020 
1GCCCTACGT ACGCGAGGGC AACGCCTCGA GGTGTTGGGT CTCCCI7AGT CCCACCGTAG 1080 
CCGCCAGGGA TTCGCGCGTC CCCGTCAGTG AGGTTCGGCG TCGTGTCGAC TCGATTGTCG 1140 
GGGCCGCTGC GTTC7GTTCG GC7A7G7A7G TAGGGGACCT ATGCGGC7CC ATCTTCCITG 1200 
77GGCCAGA7 CT7CACC77C 7C7CCCAGGC ACCA77GGAC GACGCAAGAC rGCAA7fGC7 1260 
CCA7C7ACCC AGGCCATG7G ACAGG7CA7C GAATGGC77G G6ACA7GA7C ATGAA7TGG7 1320 
[0 0 2 3] (4) cDNA7'f^7'J-ffliit:i:5 ■JJ*?U:*?F:*M'*-X (New England 
HC-G9, YS 1 1 7, i5«fctfSR0 3 7®SSffi?>J Blolabs) , T4 DNA#'J*7— fef (T a k 
©ifcg ara Biochemicals) M 1 3 

P CRiZTmmLfc&tkfcfii&n&imm&rF&T 4# 50 1 7-V^>? *>-\zm AU ?U->i£Vtc. SSSE^J 
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9 

ftSIld i deoxy chain terminat 
I o n ffitCT, S equencenase sequ 
encekit ver 2. 0 (United Sr 
are B i o c h emn i c a 1 s) fe-SWiAu t 
oRead Sequencing kit (Pha 
rmac i a) fcffl^TfTofc. SttfrlCO^T, 

«3^ -en-^nic^^TE^ft^b, & 

KmLrmmztivzisit. Hc-G9i»vvrii£mt 

S» YSll 7^Wt:SR0 3 7iC3HTtt5' 

o 6 3#a<oa»^^ 1 8 6 7mm<Dmm&-c<Dmmt 

8 2 5 9#B^69 4 4 0#BS7?©E5«tC^^Tft^ 
bfc. E?5^1tCHC-G9, K#J*2fCYS 1 1 7, 
©5' m. S^3l:YS 117, <D3' ft E^I*4 
lCSR037<Z)5'ft EM^SKSRO 3 7<0ifigE 
#I<D3' <M<D&&£E^£iKTo S21CHC-G9, Y 
S I 1 7, S R 0 3 7 (DEMrawffl^te^-To ^ife 
Ctl^3^«:CD«SE3«I<Dffl^tt«9 5%*D> ± 



(6) ftBH¥6-3 1 9 5 6 3 

10 

s^3^<DHcv«^-^ae^a»c»s$n^ 0 @ 
2 ch c - g 9 tzn*T\z±®<Dm&mmmwi<n 

oiie^ h c - g 9 «^-rn<ogEacojafi^«ccD*as 

<fco. c©sffcftMv^$nfcise?s«^< ist 
nf«n^sicjSL. *v>ftmiz\*-rv\zim. 113? 

ct^e. cntcffltoxi cuts^wtc^^ttL 

[0 0 24] 
[02] 
[0025] 
»2] 
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(7) ft0B¥6-3 1 9 5 6 3 

11 12 
CACCCACTGG CGCCTTAGT6 GTGGCACAGC TACTCCGGAT CCCACAA6CT ATCGTGGATA 1380 
TGATAGCTGG TGCCCACTGG GGTGTCCTAG CGGGCCTGGC ATACTACTCC ATGGTG6GGA 1440 
ACTGGGCTAA GGTTGTGGTC GTGCTGCTGC TCTTCGCTGG CGTTGAC6CA GAGACCCGGG 1500 
TCACAGGGGG GGCCGCTGGC CACACCGCGT TCGGGTTTGC TAGCTTCCTC GCCCCAGGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA ACGGCA6CTG GCACATCAAC AGAACTGCCT 1620 
TGAACTGTAA TGAAAGCrTG GATACTGGCT GGCTAGCAGG GCTGCTCTAC TACCACAAGT 1680 
TCAACTCCTC AG6GTGTCCC GAGAGGATGG CTAGTTGCCA ACCTCTTACC GCCTTCGACC 1740 
AAGGGTGG6G ACCCATCACT CACGAGGGGA ATGCTAGTGA TGACCAGCGG CCATATTGTT 1800 
G6CACTATGC CCTACGCCCG TGTGGCATTG TGCCAGCGAA AAAGGTT7GC GGGCCTGTAT 1860 
ACTGTTTCAC ACCCAGCCCC GTGGTAGTGG GGACGACGGA CAGAGCCCGC GTTCCTACCT 1920 
ACAGATGGGG TGCCAATGAG ACGGATGTAC TGCTCCTCAA CAACTCTAG6 CCGCCAATGG 1930 
G6AATTGGTT TGGGTGTACG TGGATGAATT CTAGTGGCTT CACCAAGACG TGCGGGGCTC 2040 
CG6CCTGCAA CATCGGCGGG AGCGGGAACA ATACCCTGCT GTGCCCAACA GATTGCTTCC 2100 
GTAAACATCC GGATGCCACA TACAGCAGG7 GCGGCTCTGG TCCCTGGCTT ACCCCTCGAT 2160 
GCTTGGTAGA CTACCCATAC AGGCTCTGGC ACTACCCCTG TACAGTCAAT TACACCATTT 2220 
TCAAGATCAG GATGTTTGTG GGCGGGGTTG AGCACAGGCT TGACGCCGCG TGCAACTGGA 2280 
CGCGGGGAGA GCGCTGCGAT TTGGACGACA GGGATCGGGC CGAGTTGAGC CCTCTGTTGC 2340 
TGTCCACTAC GCAATGGCAG GTCCTCCCCT GCTCATTCAC AACACTGCCC GCCCTGTCAA 2400 
CTGGCCTGAT ACATCTCCAC CAGAACATCG TGGACG7GCA GTACCTCTAT GGGTTGAGCT 2460 
CGGCAGTCAC ATCCTGGGTC ATAAAGTGGG AGTACGTTGT GCTCCTCTTC TTGCTGCTAG 2520 
CAGAFGCTCG CAITTG7GCC TGCTTGTGGA TGATGCTTCT CATATCTCAG GTAGAflflCGG 2580 
CGCTGGAGAA CTTGATAGTT CTCAACGCTG CTTCCCTAGT CGGGACACAT GGCATCG1CC 2640 
CCTTCTTCAT CTITTTTTGT GCAGCTTGGT ACCFAAAAGG CAAGTGGGCC CCTGGACTCG 2700 
CCTAITCCGT CTATGGGATG TGGCCACTGC TCCTGCTTCT CCTGGCGTTG CCCCAACGGG 2760 
CATACGCCTT GGATCAGGAG ITGGCCSCGT CGTGTGGGGC CACGGTCTTC ATCTGCCTAG 2820 
CGGTGCTCAC TCTATCGCCA TATTACAAAC AGTACATGGC CCGCGGCATC TGGTGGCTGC 2880 
AGTACAIGCr GACCAQAGCA GAGGCGCTCC TACAGGTTTG GGTCCCCCCG CTCAACGCCC 2940 
GAGGAGGGCG CGACGGAGTC GTACTGCICA CGTGTGTGCT CCACCCGCAC TTGCTCTTTG 3000 
AAATCACCAA GATCATGCTG GCCATTCTCG GGCCTTTGTG GATCTTGCAG 6CCAGTCTGC 3060 
TCAAGGTACC GTACTTCGTG CGTGCCCACG GTCTCATTAG GCTCTGCATG CTGGTGCGCA 3120 
AGACAGCGGG CGGTCAGTAT GTGCAGATGG CTCTGTTAAA GCTGGGAGCA TTTGCCGGCA 3180 

[0026] sasm 40 sw^ae^sirn-cDiistcsL&E^jsw-ri^-u^ 
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15 16 
CCTACATTTA CAACCACCTT TCCCCGCTCC AAGACTGGGC TCACAGCGGT CTGCGCGACC 3240 
TGGCGGTAGC CACTGAACCC GTCATCTTCT CCCGGATGGA AATCAAGACT ATCACCTGGG 33M 
GGGCGGATAC TGCGGCTTGT GGAGACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3360 
GGAGAGA6GT GTTGCrGGGA CCAGCCGATG CCCTGACTGA CAAAGGATGG AGGCTTTTAG 3420 
CCCCCATCAC GGCTTACGCC CA6CAGACAC GGGGTCTCTT G6GCTGCATC ATCACCAGCC 3480 
TCACCGGTCG GGACAAAAAT CAAGTGGAGG GGGAAGTCCA GATTGTGTCT ACCGCAACCC 3540 
AGACGTTCTT GGCTACTT6T GTTAATGGAG TTTGCT6GAC TGTGTATCAT GGGGCCGGAT 3600 
CGAGGACCAT CGCTTCGGCG TCGGGCCCTG TGATCCAGAT GTACACTAAT GTGGACCAGG 3660 
ATTTG6TGG6 CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT GACGCCGTGC ACATGCGGCG 3720 
CCTCGGATCT GTACTT6GTC ACGAGGCACG CGGACGTCAF TCCAGTGCGG CGTCGGGGCG 3780 
ATAACAGGGG AAGTTTACTA TCTCCCCQGC CAATTTCATA TCTAAAGGGA TCCTCGCGAG 3040 
GCCCCCTGCT CTGTCCCATG QGACATGCCG TGGGCATTTT CAGGGCCGCG GTGTGCACCC 3900 
GTG6GGTCGC AAAGGCGGTC GACTTTGTGC CCGTTGAATC CCTAGAGACC ACCATGAGGT 3960 
CCCCAGrGTT TACCGACAAT TCCAGCCCTC CGACAGTGCC CCAGAGCTAC CAGGTGGCGC 4020 
ATCTGCACGC TCCCACTGGA AGTGGTAAGA GCACGAAGGT GCCGGCCGCC TAfGCGGCTC 4080 
AAGGGTACAA GGTTCTTGTG CTGAACCC6T CTGTTGCTGC CACCCTAGGG TTCGGCGCTT 4140 
ATATGTCAAA GGCCCATGGG ATTGACCCAA ACGTCAGGAC TGGCGTAAGG ACCATTACCA 4200 
CAGGCTCCCC CATCACCCAC TCCACCTACG GCAAATTCCT GGCTGACGGT GGGTGTTCAG 4260 
GAGGTGCGTA TGACATCATA ATATG7GACG AATGTCACTC AGTGGACGCC ACCTCGATTC 4320 
TAGGCATAGG GACT6TCTTG GACCAAGCGG AGACAGCGGG GGTTAGGCTC ACTATCCTCG 4380 
CCACCGCTAC ACCACCfGGC TCCGTCACCG TGCCACATTC CAACATCGAG GAAGTTGCAT 4440 
TGTCCACTGA GGGGGAGATA CCAI7CTATG GTAAG6CCAT CCCCCTAAAT TACATCAAGG 4500 
GGGGGAGGCA TCTCATFTTC TGTCATTCCA AGAAGAAGTG CGACGAGCTC GCTGCAAAGC 4560 
TGGTTG6CCT GGGCGTCAAC GCAGTGGCCT TTTACCGCGG CCTCGACGTG TCTGTCATCC 4620 
CAACCACAGG AGACGTCGI7 GTTGTGGCGA CCGACGCCTf AATGACTGGC fACACCGGCG 4680 
ATTTCGACTC CGfGATAGAC TGCAACACCT GTGTCGTCCA GACAGTCGAT TTCAGCCTAG 4740 
ACCCTACATT CTCTATTGAG ACTTCCACCG TGCCCCAGGA CGCCGrGTCC CGCTCCCAAC 4800 
GGAGAGGTAG AACCGGTCGG GGGAAGCATG GTATCTACAG ATATGTGTCA CCCGGGGAGC 4860 
GGCCGTCTGG CATGTTTGAC TCCGTGGTCC TCTGTGAGTG CTATGACGCG GGTTGTGCTT 4920 
GGIATGAGCT TACACCCGCC GAGACCACGG TTAGGTTACG 6GCATATC7T AACACCCCAG 4980 
GGTTGCCCGT GTGCCAGGAC CACTTGGAGT TTTGGGAGAG CGTCTTCACC GGCCTCACCC 5040 

0031] 40 mmzmm-zaz. *ftwiz&omfz\zm%.2nfz* 
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17 18 

WMtt : 1 
WM<F>&$ : 9 4 8 7 

: -** 

fi^SS: cDNA to genomic RNA (HC-G9 CDHA) 

GCCAGCCCCC TGATGGGGGC GACACTCCGC CATGAATCAC TCCCCTGTGA GGAACTACTG 60 
TCfTCACGCA GAAAGCGTCT AGCCATGGCG TTAGTATGAG TGTCGTGCAG CCTCCAGGAC 120 
CCCCCCTCCC GGGAGAGCCA TAGTGGTCTG CGGAACCGGT GAGTACACCG GAATTGCCAG 180 
GACGACCGGG TCCTTTCTTG GATTAACCCG CTCAATGCCT GGAGA1TTGG GCGTGCCCCC 240 
GCAAGACTGC TAQCCGAGTA GTGTTGGGTC GCGAAAGGCC TTGTGGTACT GCCTGATAGG 300 
GTGCTTGCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CATGAGCACG AATCCTAAAC 360 
CTCAAAGAAA AACCAAACGT AACACCAACC GCCGCCCACA GGACG7TAAG TTCCCGGGTG 420 
GCGGCCAGAT CGTTGGCGGA GTTTACTTGT TGCCGCGCAG GGGCCCCAGA GTGGGTGTGC HQ 
GCGCGACGAG GAAGACTTCC GAGCGGTCGC AACCTCGCGG GAGGCGTCAG CCTATTCCCA 540 
AGGCCCGCCG ACCCGAGGGA AGGTCCTGGG CGCAGCCCGG GTACCCTrGG CCCCTCTATG 600 
GCAACGAGGG CTGTGGGTGG GCGGGATGGC TCCTGTCCCC CCGCGGCTCT CGGCCTAGTT 660 
GGGGCCCTTC TGACCCCCGG CGGAGGTCAC 6CAATTTGGG TAAGGTCATC GATACCCTCA 720 
CGTGTGGCTT CGCCGACCTC ATGGGGTACA TCCCGCTCGT CGGCGCTCCT CTAGGGGGCG 780 
CTGCCAGAGC TCTGGCACAT GGTGTTAGAG TCCTGGAAGA CGGCGTGAAf TACGCAACAG «40 
GGAACCTCCC CGGTfGCTCT TTTTCTATCT TCT7GCTCGC TCTTCTATCC TGCCTGACAG 900 
TCCCTGCTTC GGCCGTCGGA GTGCGCAACT CTTCGGGGGT GTACCATGTC ACCAATGATT 960 
GCCCCAATGC GTCCGI7GTG TACGAGACGG AGAACCTGAT CATGCATCTG CCCGGGTGTG 1020 
7GCCCTACGT ACGCGAGGGC AACGCCTCGA GGTGTTGGGT CTCCCI7AGT CCCACCGrAG 1080 
CCGCCAGGGA TTCGCGCGTC CCCGTCAGTG AGGT7CGGCG TCGTGTCGAC TCGATTGTCG 1 140 
GGGCCGCTGC GTTC7GTTCG GCTATGTATG TAGGGGACCT ATGCGGCTCC ATCTTCCTTG 1200 
TTGGCCAGAT CTTCACCTTC TCTCCCAGGC ACCATTGGAC GACGCAAGAC FGCAATTGCT 1260 
CCATCTACCC AGGCCATGTG ACAGGTCATC GAATGGCTTG G6ACATGATC ATGAATTGGT 1320 



—798- 



(11) if$ffl¥6-3 1 9 5 6 3 

19 20 
CACCCACTGG CGCCTTAGT6 GTGGCACAGC 7AC7CCGGA7 CCCACAAGCT ATCGTGGATA 1350 
7GA7AGC7GG TGCCCACTGG GGTGTCCTAG CGGGCCTGGC ATACTACTCC ATGGTG6GGA 1440 
ACTGGGCTAA GGTTGTGGTC GTGCTGCT6C TCTTCGCTGG CGTTGACGCA GAGACCCGGG 1500 
7CACAGGGGG GGCCGCTGGC CACACCGCGT TCGGGTTTGC TAGCTTCCTC GCCCCAGGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA ACGGCAGCTG GCACATCAAC AGAACTGCCT 1620 
TGAACTGTAA TGAAAGC77G GATACTGGCT GGC7AGCAGG GC76C7C7AC TACCACAAGT 1680 
TCAACTCCTC AGGGTGTCCC GAGAGGATGG CfAGTTGCCA ACCTCnACC GCCTTCGACC 1740 
AAGGGTGGGG ACCCATCACT CACGAGGGGA ATGCTAGTGA TGACCAGCGG CCATATTGTT 1500 
G6CACTATGC CCTACGCCCG TGTGGCATTG 7GCCAGCGAA AAAGGTTTGC GGGCCTGTAT 1S60 
ACTGTTTCAC ACCCAGCCCC GTGGTAGTGG GGACGACGGA CA6AGCCGGC GTTCCTACCT 1920 
ACAGA7GGGG TGCCAATGAG ACGGATGTAC TGCTCCTCAA CAACTCTAGG CCGCCAATGG 1980 
GGAATTGGTT TGGGTGTACG TGGATGAATT CTAGTGGCTT CACCAAGACG TGCGGGGCTC 2040 
CGGCCTGCAA CATCGGC6GG AGCGGGAACA ATACCCTGCT GTGCCCAACA 6ATTGC7TCC 2100 
GTAAACATCC GGATGCCACA TACAGCAGGT GCGGCTCTGG TCCCTGGCTT ACCCCTCGAT 2160 
GCTTGGTAGA CTACCCATAC AGGCTCTGGC ACTACCCCTG TACAGTCAAT TACACCATTT 2220 
TCAAGATCAG GATGTTTGTG GGCGGGGTTG AGCACAGGCT TGACGCCGCG TGCAACTGGA 2280 
CGCGGGGAGA GCGC7GCGA7 TTGGACGACA GGGATCGGGC CGAGTTGAGC CCTCTGTTGC 2340 
TGTCCACTAC GCAATGGCAG GTCCTCCCCT GCTCATTCAC AACACTGCCC GCCCTGTCAA 2400 
CTGGCCTGAT ACATCTCCAC CAGAACATCG TGGACG7GCA GTACCTCTAT GGGTTGAGCT 2460 
CGGCAGTCAC ATCCTGGGTC ATAAAGTGGG AGTACGTTGT GC7CCTC7TC TTGCTGCTAG 2520 
CAGATGCTCG CATTTGTGCC TGCTTGTGGA TGATGCTTCT CA7A7C7CAG G7AGAGGCGG 2580 
CGCTGGAGAA CTTGATAGTT CTCAACGCTG CTTCCCTAGT CGGGACACAT GGCATCG1CC 2640 
CCTTCTTCAT CTTTTTT7GT GCAGCTTGGT ACCTAAAAGG CAAGTGGGCC CCTGGAC7CG 2700 
CCTATTCCGT C7A7GGGA7G 7GGCCAC7GC 7CC7GC77C7 CC7GGCG7TG CCCCAACGGG 2760 
CA7ACGCC77 GGA7CAGCAG 77GGCCGCG7 CG7G7GGCGC CACGG7C7TC A7C7GCC7AG 2820 
CGG7GC7CAC 7C7A7CGCCA 7A77ACAAAC AG7ACA7GGC CCGCGGCA7C 7GG7GGC7GC 2880 
AG7ACA7GC7 GACCAGAGCA GAGGCGC7CC 7ACAGG777G GG7CCCCCCG C7CAACGCCC 2940 
GAGGAGGGCG CGACGGAG7C G7AC7GCTCA CG7G7G7GC7 CCACCCGCAC TTGC7C777G 3000 
AAA7CACCAA GA7CA7GC7G GCCA77C7CG GGCC777G7G GATC77GCAG GCCAGTC7GC 3060 
TCAAGG7ACC G7AC77CG7G CGTGCCCACG G7C7CA77AG GCTCTGCATG C7GG7GCGCA 3120 
AGACAGCGGG CCG7CAG7A7 GfGCAGA7GG C7C7G77AAA GC7GGGAGCA 777GCCGGCA 3180 
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21 22 
CCTACATTTA CAACCACCTT TCCCCGCTCC AA6ACTGGGC TCACAGC6GT CTGCGCGACC 3240 
TGGCGGTAGC CACTGAACCC 6TCATCTTCT CCCGGATGGA AATCAAGACT ATCACCTGGG 3300 
GGGCGGATAC TGCGGCTTGT 66AGACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3360 
GGAGAGA6GT GTTGCTGGGA CCAGCCGATG CCCTGACTGA CAAAGGATGG AGGCTTTTAG 3420 
CCCCCATCAC GGCTTACGCC CA6CAGACAC GGGGTCTCTT GGGCTGCATC ATCACCAGCC 3430 
TCACCGGTCG GGACAAAAAT CAAGTGGAGG GGGAAGTCCA GATTGTGTCT ACCGCAACCC 3540 
AGACGTTCTT GGCTACTTGT GTTAATGGAG TTTGCTGGAC TGTGTATCAT QGGGCCGGAT 3600 
CGAGGACCAT CGCTTCG6CG TCGGGCCCT6 TGATCCAGAT GTACACTAAT GTGGACCAGG 3660 
ATTTGGTGGG CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT GACGCCGTGC ACATGCGGCG 3720 
CCTCGGATCT GTACTTGGrC ACGAGGCACG CGGACGTCAT TCCAGTGCGG CGTCGGGGCG 3780 
ATAACAGGGG AAGTTTACTA TCTCCCCGGC CAATTTCATA TCTAAAG6GA TCCTCGGGAG 3840 
GCCCCCTGCT CTGTCCCATG GGACATGCCG TGGGCATTTT CAGGGCCGCG GTGTGCACCC 3900 
GTGGGGTCfiC AAAGGCGGTC GACTTTGTGC CCGTTGAATC CCTAGAGACC ACCATGAGGT 3960 
CCCCAGFGTT TACCGACAAT TCCAGCCCTC CGACAGTGCC CCAGAGCTAC CAGGTGGCGC 4020 
ATCTGCACGC TCCCAC7GGA AGTGGTAAGA GCACGAAGGT GCCGGCCGCC TATGCGGCTC 4080 
AAGGGTACAA GGTTCTTGTG CTGAACCCGT C1GTTGCT6C CACCCTAGGG TTCGGCGCTT 4140 
ATATGTCAAA GGCCCATGG6 ArTGACCCAA ACGTCAGGAC TGGCGTAAGG ACCA17ACCA 4200 
CAGGCTCCCC CATCACCCAC TCCACCTACG GCAAATTCCT GGCTGACGGT GGGTGTTCAG 4260 
GAGGTGCGTA TGACATCATA ATATGTGACG AATGTCACTC AGTGGACGCC ACCTCGATTC 4320 
TAGGCATAGG GACTGTCTTG GACCAAGCGG AGACAGCGGG GGTTAGGCTC ACTATCCTCG 4380 
CCACCGCTAC ACCACCTGGC TCCGTCACCG TGCCACATTC CAACATCGAG GAAGTTGCAT 4440 
TGTCCACTGA GGGGGAGATA CCATTCTATG GTAAGGCCAT CCCCCTAMT TACATCAAGG 4500 
GGGGGAGGCA TCTCATTTTC TGTCATTCCA AGAAGAAGrG CGACGAGCTC GCTGCAAAGC 4560 
TGGTTGGCCT GGGCGTCAAC GCAGTGGCCT ITTACCGCGG CCTCGACGTG TCTGTCATCC 4620 
CAACCACAGG AGACGTCGI7 GTTGTGGCGA CCGACGCCTT AATGACTGGC TACACCGGCG 4680 
ATTrCGACTC CGTGATAGAC TGCAACACCT GTGTCGTCCA GACAGTCGAT TTCAGCCTAG 4740 
ACCCTACATT CTCTATTGAG ACTTCCACCG TGCCCCAGGA CGCCGTGTCC CGCTCCCAAC 4800 
CGA6AGSTAG AACCGGTCGG GGGAAGCATG GTATCTACAG ATATGTGTCA CCCGGGGAGC 4860 
GGCCGTCTGG CATGTTTGAC TCCGTGGTCC TCTGTGAG1G CTATGACGCG GGTTGTGC17 4920 
GGTATGAGCT TACACCCGCC GAGACCACGG TTAGGTTACG 6GCATATCTT AACACCCCAG 4980 
GGTTGCCCGT GTGCCAGGAC CACTTGGAGT TTTGGGAGAG CGTCTTCACC GGCCTCACCC 5040 
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23 24 
ACATA6ATGC CCACTTCCTG TCTCAGACGA AACAGAGCGG GGAAAATTTC CCCTACCTAG 5100 
TC6CATACCA AGCCACC67G 76CGCTAGAG CTAAAGCTCC TCCCCCGTCA TGGGACCAAA 5160 
T6TGGAAGTG C77GA7ACGG CTCAAGCCCA CCCTCAC76G GGCTACCCCC CTACTATACA 5220 
GACTGQGTGG 767GCAGAA7 GAGATCACCC TAACACACCC AA7CACCAAG TACATCATGG 5280 
C7TG7A7GTC GGCTGACC7G GAGG7CGTCA CTAGCACGTG GGTGCTGGTG GGCGGCGTCC 5340 
TGGCCSC7TT GGCCGCTTAC TGCCT6TCTA CAG6CAGC67 GGTCATAGTG GGCAGGATAA 5400 
TCCTAAGCGG GAAGCCGGCA GTCATTCCTG ACAGGGAG6T TCTCrACCGA 6AG77TGA7G 5460 
AGA7GGAAGA GTGCGCCGCC CACATCCCC7 ACCTTGAGCA GGGGATGCAT TTGGCTGAAC 5520 
AGTTCAAGCA GAAAGC7C7C GGGTTGCTCC AGACAGCATC CAAGCAAGCA GAGACGATCA 5580 
CTCCCGCTGT CCATACCAAT TGGCAGAAAC TCGAGTCCTT CTflGGCTAAG CACATGTGGA 5640 
ACTTCGTCAG CGGGATACAA TACCTGGCGG GCCTGTCAAC GC7GCCCGGT AATCCCGCTA 5700 
TAGCGTCGCT GATGTCGTTT ACAGCCGCGG TGACGAGTCC AC7AACCACC CAGCAAACCC 5760 
7CC7C7TTAA CATCCTTGGG GGGTGGGTGG CCGCCCAGCT TGCCGCCCCA GC7GCCGCCA 5820 
CTGCTTTCG7 CGGCGC7GGT ATTACCGGCG CTGTCA7C6G CAG7GTGGGC CTAGGGAAGG 5880 
7CCTAGTGGA CATTCT7GCT GGC7AC6GGG CTGG7GTGGC GGGGGCCC77 GTGGC777CA 5940 
AGATCA7GAG CGGGGAGGCC CCCACCGCCG AGGA7CTAGT CAACCT7CTG CCTGCCAICC 6000 
TC7CGCCAGG AGCTCTCGT7 GTAGGCG7GG TGTGCGCAGC AATACTACGC CGGCACGTGG 6O60 
GCCC7GGCGA GGGCGCGG7G CAGTGGArGA ACCGACTGAT AGCGTTTGCT 7CTCGGGG7A 6120 
ACCACG7C7C CCCTACACAC TATG7GCCAG AGAGCGACGC G7CAG7CCGT GrCACACATA 6180 
7CCTCACCAG CCTCAC7G7C AC7CAGCTCC TGAAAAGGCT CCACG7G7GG A7AAGCTCAG 6240 
A7TGCACCGC CCCGTG7GC7 GGT7CTTGGC TCAAAGA7G7 C7GCGAC7GG A7A7GCGAGG 6300 
TGCTGAGCGA CTTCAAGAG7 7GGC7GAAGG CCAAAC77A7 GCCGCAACTG CCCGGGA7CC 6360 
CA7TCGTA7C CTG7CAACGC GGG7ACCG7G GGG7C7GGCG GGGCGAAGGC A7CA7GCACG 6420 
CCCGT7GCCC GTGTGGAGCC GA7A7AAC7G G7CA7G7CAA AAACGG77CG A7GAGAA7CG 6480 
7CGGCCC7AA GAC7TGCAGC AACACC7GGC G7GGG7CG7T CCCCA7CAAC GCCCACAC7A 6540 
CGGGCCC7TG CACACCC7CC CCAGCGCCGA ACTACACG7T CGCG77ATGG AGGGTGTCGG 6600 
CAGAGGAGTA 7G7GGAGG7A AGGCGGC7GG GGGATTTCCA TTACATCACG GGGGTGACCA 6660 
CTGATAAGA7 CAAGTG7CCA 7GCCAGG7CC CCTCGCCCGA G77CT7CACA GA6GTGGATG 6720 
GGG7GCGCCT ACA7AGGTAC GCCCCCCCC7 GCAAACCCCT GC7ACGGGA7 GAGG7GACGT 6780 
T7AGCA7CGG GCTCAATGAA TAC7TGGTGG GGTCCCAG7T GCCC7GCGAG CCCGAGCCAG 6840 
ACG7AGCTG7 ACTGACA7CA ATGC77ACAG ACCCCTCCCA CA7CACTGCA GAGACGGCGG 6900 
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25 26 
CGCGTAGACT GAATCGGGGG TCTCCCCCCT CCCTGGCTAG CTCTTCT6CC A6CCAATTGT 6960 

CTGCGCCGTC CCTGAAAGCA ACATGTACCA CCCACCATGA CTCTCCAGAC GCTGACCTCA 7020 

TAACAGCCAA CCTCCTGT6G AGGCAGGAGA TGGGGGGGAA CATTACCAGA GTGGAGTCGG 7080 

AGAATAAGAT CGTCATCCTG GATTCTTTCG ACCCGCTCGT GGCGGAGGAG GAFGATCGGG 7140 

AGATTTCTGT CCCAGCTGAG ATTCTGCTGA AGTCTAAGAA GTTTCCCCCC GCCATGCCTA 7200 

TATGGGCACG GCCAGATTAT AATCCTCCCC TTGTGGAACC ATGGAAGCGC CC6GACTACG 7260 

AACCACCCTT AGTTCACGGG TGCCCCCTAC CACCTCCCAA ACCAACTCCG GTGCCGCCAC 7320 

CCCGGAGAAA GAGGACAGTG GTGCTGGATG AGTCTACAGT ATCATCTGCT CTGGCTGAGC 7380 

TTGCCACTAA GACCTTTGGC AGCTCTACAA CCTCAGGCGT GACAAGTGGT GAAGCGGCCG 7440 

AATCQTCCCC 6GCGCCTTCC TGCGACGGTG AACTGGACTC CGAAGCTGAA TCTTACTCCT 7500 

CCATGCCCCC TCTCGAGGGG GAACCGGGGG ACCCCGATCT CAGCGACGGG TCTTGGTCTA 7560 

CCGTAAGCAG TGATGGCGGT ACGGAGGACG TCGTGTGCTG CTCGATGTCC TACTCGTGGA 7620 

CGGGCGCCCT AATTACGCCC TGTGCCGCAG AGGAAACCAA ACTCCCCATC AACGCACTGA 7680 

6TAACTCGCT GCTGCGCCAC CACAATTT6G TGTATTCCAC CACCTCTCGC AGCGCTGGCC 7740 

AGAGGCAGAA AAAAGTCACA TTTGACAGGC TGCAGGTCCT GGACGATCAT TACCGGGACG 7800 

TGCTCAAGGA GGCTAA6GCC AAGGCATCCA CAGTGAAGGC TAAATTGCTA TCCGTAGAGG 7860 

AGGCATGTAG CCTGACGCCC CCGCACTCCG CCAGATCAAA ATTTGGCTAT GGGGCGAAGG 7920 

ATGTCCGAAG CCATTCCAGT AAGGCCATAC GCCACATTAA C7CCGTGTGG CAAGACCTTC 7980 

TGGAGGACAA TACAACACCT ATAGACACTA CCATCATGGC AAAGAATGAG GTCTTCTGCG 8040 

TGAAGCCCGA AAAGGGGGGC CGCAAGCCCG CTCGTCTTAT CGTGTACCCC GACCTGGGAG 8100 

TGCGCGTATG CGAGAAGAGG GCTTTGTArG ACGTAGTCAA ACAGCTCCCC ATTGCCG7GA 8160 

TGGGAACCTC CTACGGGTTC CAGTACTCAC CAGCGCAGCG GGTCGACTTC CTGCTTAAT6 8220 

CGTGGAAATC AAA6AAAAAT CCTATG6GGT TTTCCTAT6A CACCCGTTGC TTTGACTCGA 8280 

CAGTCACTGA GGCTGATATC CGTACG6AGG AAGACCTCTA CCAATCTTGT GACCTGGTCC 8340 

CTGAGGCCCG CGCGGCCATA AGGTCTCTCA CAGAGAGGCT ITACAfCGGG GGCCCGCTTA 8400 

CCAACTCTAA GGGACAAAAC FGCGGCTATC GGCGATGCCG CGCGAGCGGC GTGCTGACCA 8460 

CTAGCTGCGG TAACACCATA ACCTGTTATC TCAAGGCCAG TGCAGCCTGT CGAGCTGCAA 8520 

AGCTCCGGGA CTGCACTATG CTCGTGTGCG GCGACGACCT CGTCGTTATC TGTGAGAGCG 8580 

CCGGTGTCCA GGAGGACGCT GCGAACCTGA GAGCCTTCAC GGAGGCTATG ACCAGGTACT 8640 

CCGCCCCCCC GGGAGACCCG CCTCAACCAG AATACGACTT GGAGCTTATA ACATCTTGTT 8700 

CCTCCAATGT TTCAGTCGCG CACGACGGCG CTGGCAAAAG GGTCTACTAT CTGACCCGTG 8760 
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(15) #ffl¥6 - 3 1 9 5 6 3 

21 28 

ATCCT6AGAC TCCCCTCGCG CGTGCCGCTT G6GAAACAGC AAGACACACT CCAGTGAACT 8820 

CCTGGCTAGG CAACATCATC ATGTTTGCCC CCACTCTGTG GGTACGGATG GTCCTTATGA 8880 

CCCATTTTTT CTCCATACTC ATAGCCCAGG AACACCTTGA AAA66CTCTA GATTGTGAAA 8940 

TCrATGGGGC CGTGCACTCC GTCCAACCGT TAGATCTACC TGAAATCAU CAAAGACTCC 9000 

ACGGCCTCAG CGCGTTCTCG CTCCATAGTT ACTCTCCAGG TGAAArCAAT AGGGTGGCTG 906O 

CATGCCTCAG GAAACTTGGG GTTCCGCCCT TGCGAGCTTG GAGACACCGG GCCCGGAGCG 9120 

TCCGCGCCAC ACTCCTATCC CAGGGGGGGA GA6CCGCTAT ATGCG6TAAG TATCTCTTCA 9180 

ACTGGGCGGT GAAAACCAAA CTCAAACTCA CTCCATTACC GTCCGCGTCT CAGTTGGACT 9240 

TGTCCAATTG GTTCACGGGC GGTTACAGCG GGGGAGACAT TTATCACAGC GTGTCTCATG 9300 

TCCG6CCCCG CTGGTTCTTC TGGTGCCTAC TCCTACTTTC AGTGGGGGTA GGCATT7ACC 9360 

TCCTTCCCAA CCGGTAGACG GTTGG6CAAC CACTCCAGGC CTTTAGGCCC TGTTTAAACA 9420 

CTCCAGGCCT TTAGGCCCCG TTTTTTTT7T TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 9480 

ttttttt mi 

[0 0 3 3] 
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(16) *JIB¥6-3 1 9 5 6 3 

29 30 

mitt : 2 

: 1 7 6 5 

ffiMtf)SUB : c D N A to genomic RNA (YS117-5' CDNA) 

CCATGGCGTT AGTATGAGTG TCGTGCAGCC TCCAGGACCC CCCCTCCCGG GAGAGCCATA 60 
GTGGTCTGCG GAACCGGTGA GTACACCGGA ATTGCCAGGA CGACCGGGTC CTTTCTTGGA 120 
TCAACCCGCT CAATGCCTGG AGATTTGGGC GTGCCCCCGC AAGACTGCTA GCCGAGTAGT 180 
GTTGGGTC6C GAAAGGCCTT GTGGTACTGC CTGATAGGGT GCTTGCGAGT GCCCCGGGAG 240 
GTCTCGTAGA CCGTGCACCA TGAGCACAAA TCCrAAACCT CAAAGAAAAA CCAAACGTAA 300 
CACCAACCGC CGCCCACAGG ACGTTAAGTT CCCGGGTGGC 6GCCA6ATCG TTGGCGGAGT 360 
TTACTTGTTG CCGCGCAGGG GCCCCAGAGT GGGTGTGCGC GCGACGAGGA AGACTTCCGA 420 
GCGGTCGCAA CCTCGCGGAA GGCGTCAGCC TATCCCCAAG GCCCGCCGAC CCGAGGGTAG 480 
GTCCTGGGCG CAGCCCGGG7 ACCCTTGGCC CCTCTATGGC AACGAGGGCT GCGGGTGGGC 540 
GGGATGGCTC CTGTCCCCCC GCGGCTCTCG GCCrAGTTGG GGCCCCACTG ACCCCCGGCG 600 
GAGGTCACGC AATTTGGGTA AGGTCATCGA TACTCTCACG TGTGGCTTCG CCGACCTCAT 660 
GGGGTACATC CCGCTCGTCG GTGCTCCTCT AGGGGGCGCT GCCAGAGCTC TGGCACACGG 720 
TGTTAGAGTT CTGGAAGACG GCGTGAACTA CGCAACAGGG AACCTTCCTG GTTGCTCCTT 780 
TTCfATCTTC TTGCTCGCTC TTCTATCCTG CCTGACAGTC CCTGCTrCGG CCGTCGAAGT 840 
GCGCAACTCA TCAGGGGrGT ACCATGTCAC CAATGATTGC CCCAATGCGT CCGTTGTGTA 900 
CGAGACAGAG AGCC7GATCA TGCATCTGCC CGGGTGTGTG CCCTGCGTAC GCGAGGGCAA 960 
CGCCTCGAGG TGCTGGGTCT CCCTTAGCCC TACCATTGCC GCTAAGGATC CGAGCGTCCC 1020 
CGTCAGTGAG ATTCGGCGTC ACGTCGACTT GATCGTCGGG GCCGCCGCGT TCTGTTCGGC 1080 
TATGTATGTA GGGGACCTAT GCGGCICCAT CTTCCTCGTT GGCCAGATTT TCACCTTCTC 1140 
TCCCAGGCGC CATTGGACGA CGCAGGACTG TAATTGCTCC ATCTACCCGG GCCATGTGAC 1200 
AGGTCATCGA ATGGCTTGGG ACATGATGAT GAATTGGTCA CCCACTGGCG CCCTAGTGAT 1260 

GGCGCAGCTA CTCCGGATCC CACAAGCTGT CGTGGATATG ATAGCCGGTG CCCACTGGGG 1320 

TGTCCTAGCG GGCCTGGCAT ACTACTCCAT GGTGGGGAAC TGGGCTAAGG TTG7GGTTGT 1380 

GCTGTTGCTC TTCGCTGGCG TCGACGCGGA CACCCAGGTC ACAGGAGGCA GCGCTGCCTA 1440 

TGATGCGCGC GGACTTGCTT CCCTTTTCAC CCCAGGCCCT AAGCAAAACA 1CCAGCTCAT 1500 

AAATACCAAC GGCAGCTGGC ACATCAACAG GACCGCCTTG AACTGTAATG AAAGCCTGAA 1560 

CACCGGCTGG GTAGCAGGCC TGrTCTACTA TCACAAATTC AACTCCTCGG GGTGTCCTGA 1620 

GAGGATGGCT AGTTGCCAGC CCCTCACCGC CTTTGACCAA GGGTGGGGAC CCATCACTTA 1680 

CGAGGGGAAT GCTAGCGGCG ACCAACGGCC ATATTGCT6G CACTATGCCC CACGCCCGTG 1740 

CGGTATTGTG CCGGCGAGAG AGGTT 1765 
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(17) ^08^6-3 1 9 5 6 3 

31 32 

[0 0 3 4] 

Efl#* : 3 

E*lfO££ .119 1 

mmm. ■. mm 

: -*g 

WS^WS. : c D N A to genomic RNA (YS117-3' cDNA) 

AACAG7CAC7 GAGGC7GATA TCCGTACGGA GGAAGACCTC TACCAATCT7 GTGACCTGGT 60 

CCCCGAGGCC CGCACGGCCA TAAGGTCTCT CACAGAGAGG CTTTACATCG GGGGCCCCCT 120 

7ACCAAT7CC AAG6GACAAA ACTGCGGCTA fCGGCGATGC CGTGCAAGCG GCGTGCTGAC 160 

CACTAGCTGC GGTAACACCA TAACCTGTTA TCTCAAGGCC AGCGCAGCCT GTCGAGCTGC 240 

AAAGCTCCAG GACTGCACCA TGCTCGTGTG CGQCGACGAC CTCGTCGTTA TCrGTGAGAG 300 

CSCCGGTGTC CAGGAGGACG CTGCGAGCCT GAGAGCCTTC ACGGAGGCTA 7GACCAGGTA 360 

CTCCGCCCCC CCGGGAGACC CGCCTCAACC AGAATACGAC TTGGAGCTCA TAACATCCTG 420 

TTCCTCCAAC GTGTCAGTCG CGCACGACGG CTCTGGCAAA AGGGTCTACf ATCTGACCCG 480 

T6ATCCTGAG ACTCCCCTCG CGCGTGCCGC TTGGGAGACA GCAAGACACA CTCCAGTGAA 540 

CrCCTGGCTA GGCAACATCA TCATGTTTGC CCCCACTCTG TGGGTACGGA TGGTTCTTAT 600 

GACCCATTTT TTTTCCATAC TCATAGCCCA GGAGCACCTT GAAAAGGCTC TAGATTGTGA 660 

AATCTATGGA GCCGTACAC1 CCGTCCAACC GCTGGACCTA CCTGAAATCA TTCAAAGACT 720 

CCACGGCCTC AGCGCGTTTT CGCTCCACAG T7ACTCTCCA GGTGAAATCA ATAGGGTGGC 780 <- 

TGCATGCCTC AGAAAACTTG GGGTTCCGCC CTTGCGAGCT TGGAGACACC GGGCCCGGAG 640 

CGICCGCGCC ACACTCCTAT CCCAGGGGGG GAAAGCTGCT ATATGCGGTA AGTACCTCTT 300 

CAACTGGGCG GTGAAAACCA AACTCAAACT CAC7CCA7TA CCGTCCGCGT CTCAGTTGGA 960 
CTTGTCCAAT rGGTTCACGG GCGGCTACAG CGGGGGAGAC ATTTATCACA GCGTGTCTCA 1020 
TGTCCGGCCC CG77GG77C7 TCTGGTGCCT ACTCCTACTT TCAGTGGGGG TAGGCATCTA 1060 
7C7CC77CCC AACCGATAGA CGGTTGGGCA ATCACTCCTA GCCTTTAGGC CTTATTTAAA 1U0 
CACTCCAGGC CTTTAGGCCC TGTTTTTT7T 77777T777T 7T7T7T7777 7 1191 

[0 0 3 5] 
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(18) *JDB¥6-3 1 9 5 6 3 

33 34 

: 4 

&m<7)&$ : 1 7 6 5 

WMam : WSt 

: 

E^SS : cDNA to genomic RN A (SR037-S' cOHA) 

CCATGGCGTT ACTATGAfiTG TCGTGCAGCC TCCAGGACCC CCCCTCCCGG GAGAGCCATA 60 

GTGGTC7GCG GAACCGGTGA GTACACCGGA ATTGCCAGGA CGACCGGGTC CTTTCrTGGA 120 

TTAACCCGCT CAATGCC7GG AGATTTGGGC GTGCCCCCGC AAGACTGCTA GCCGAGTAGT 1*0 

GTTGGG7CGC GAAAGGCCTT GTGGTACTGC CTGATAGGGT GCTTGCGAGT GCCCCGGGAG 240 

GTCTCGTAGA CCGTGCACCA TGAGCACGAA TCCTAAACCT CAAAGAAAAA CCAAACGTAA 300 

CACCAACCSC CGCCCACAGG ACGTCAAGTT CCCGGGTGGC GGCCAGATCG TTGGCGGAGT 360 

TTACTTGTT6 CCGCGCAGGG GCCCCAGAAT G6G7GTGCGC GCGACGAGGA AGACTTCCGA 420 

GCGGTCGCAA CC7CGCGGAA GGCGTCAGCC TATTCCCAAG GCCCGCCGAC CCGAGGGTAG 480 

6TCCT6GGCG CAGCCCGGGT ACCCTTGGCC CCTCTATGGT AACGAGGGCT GTGGGTGGGC 540 

GGGATGGC7T CTGTCCCCCC GCGGTTCCCG GCCTAGTTGG GGCCCCTCTG ACCCCCGGCG 600 

GAGGTCACGC AACTTGGGTA AGGTCATCGA TACCCTCACG TGTGGCTTCG CCGACCTCAT 660 

GGGGTACATC CCGCTCGTCG GTGCTCCTTT AGGGGGCGCT GCCAGAGCTC TGGCGCATGG 720 

TGTCAGAGn CTGGAAGACG GCGTGAATTA TGCAACAGGG AACCTrCCCG GTTGCTCTTT 780 

TTCTATCTTC TTGCTTGCCC TTCTATCCTG CCTGACAGTC CCTGCTTCGG CCGTCGGAGT 840 

GCGCAACTCT TCGGGGGTG7 ACCATGTCAC CAATGATTGC CCCAATGCGT CTGTTGTGrA 900 

CGAGACAGAG AGCCTGATCA TACATCTGCC CGGGTGTGTG CCCTGCGTAC GCGAGGGCAA 960 

CGCCTCGAGG TGCTGGGTCT CCCTTAGTCC TACIGTTGCC GCTAAGGATC CGAGCGTCCC 1020 

CGTCAGTGAG ATTCGACGCC ATGTCGACCT GATTGTCGGG GCCGCTGCGT TCTGTTCGGC 1080 

TATGTACGTA GG6GACCTAT GCGGCTCCAT CTTCCTCGTT GGCCAGATTT TCACCCTCTC 1140 

TCCCAGGCGT CACTGGACGA CGCAGGACTG TAATTGTTCC ATCTACCCAG GCCATGTGAC 1200 

AGGTCATCGA ATGGCTTGGG ACATGATGAT GAATTGGTCA CCTACTGGCG CCCTAGTGGT 1260 

GGCGCAGCTA CTCCGGATCC CACAAGCTGT CGTGGATATG ATAGCCGGTG CCCACTGGGG 1320 

TGTCCTAGCG GGCCTGGCAT ACTATTCCAT GGTGGGGAAC TGGGCTAAGG TTGTGGTTGT 1380 
GCTGCTACTT TTTGC7GGCG TCGA7GCAGA GACCCAGGTC TCAGGAGGCT CCGCTGCCCA 1440 

AACCACGTAC GGTCTTACTG CCCTCTTCAG GACAGGCCCT AATCAAAAAA TCCAGCTCAT 1500 

AAATACCAAC GGCAGCTGGC ATATCAACAG GACCGCCTTG AAC7GTAATG AGAGCTTGCA 1560 

CACCGGCTGG CTGGCAGCGC 7GTTCTACAC CCACAAGTTC AACTCTTCGG GGTGTTTGGA 1620 

GAGGATGGCC AGTTGCCAGC CTCTTTCCGC CT7CGACCAA GGGTGGGGAC CCATCACTTA 1680 

CGGGGGGAAT GCTAGCGACG GCCAACGGCC ATATTGCTGG CACTATGCCC CACGCCCGTG 1740 

CGGTA77G7G CCGGCGAGAG AGGT7 1765 
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(19) #$¥6-3 1 9 5 6 3 

35 36 

[00 36] 

W3m^ : 5 
fflil<7)&Z : 1 1 7 9 
mi<7>M : 

cDNA to genomic RNA (SR037-3* cONA) 

AACAG7CAC7 GAGGC7GA7A TCCGCACGGA GGAAGACCTC TACCAATCTT G7GACCTGG7 60 
CCCTGAGGCC CGCACGGCCA TAAGGTCCCT CACAGAGAGG CTTTACATCG GGGGCCCGCT 120 
TACCAATTCT AAGGGACAAA ACTGCGGCTA TCGGCGATGC CGCGCAAGCG GCGTGCTGAC 180 
CACrAGCTGC GGTAACACCA TAACCTGTTA TCTCAAGGCC AGTGCAGCCT GTCGAGCTGC 240 
AAAGCTCCGG GACTGCACTA TGCTCGTGTG CGGCGATGAC CTTGTCGTTA TCTGTGAGAG 300 
CGCCGGTGTC CAGGAGGACG CTGCGAGCCT GAGAGCCTTC ACGGAGGC7A TGACCAGGTA 360 
CTCTGCCCCC CCG6GAGACC CGCCTCAACC AGAATACGAC TTGGAGCTTA TAACATCCTG 420 
TTCCTCCAAT G7G7CAGTCG CGCACGACGG CGC7GGCAAA AGGGTCTACT ATCTGACCCG 480 
TGATCCTGAG ACCCCCCTCG CGCGTGCCGC TTGGGAGACA GCAAGACACA CTCCAGTGAA 540 
CTCCT66CTA GGCAACATCA TTATGTTTGC CCCCACTTT6 TGGGTAC6GA TGGTCCTCAT 600 
6ACCCATTTT TTCTCCATAC TCATAGCCCA GGAGCACCTT GAAAAGGCTC 7AGA77GT6A 660 
AATCTATGGA GCCGTACACT CCATCCAACC GCTGGACCTA CCTGAAATCA TTCAAAGACT 720 
CCACGGCCTC AGCGCGTT7T CGCTCCACAG TTAC7CTCCA GGTGAAATCA ATAGGGTGGC 780 
TGCATGCCTC AGAAAACTTG GGGTTCCGCC CTTGCGAGCT TGGAGACACC GGGCCCGGAG 840 
CGTCCGCGCC ACACTCCTAT CCCAGGGGGG GAAAGCCGCT ATATGCGGTA AGTACCTCTT 900 
CAACTGGGCG GTGAAAACCA AACTCAAACT CACrCCAFTA CCGTCCGCGf CTCAGTTGGA 960 
CTTQTCCAAT 7GGTTCACGG GCGGC7ACAA CGGGGGAGAC A77TATCACA GCG7GTC7CG 1020 
7G7CCGGCCC CG77GG77C7 7C7GGTGCC7 ACTCC7ACrC 7CAG7GGGGG 7AGGCATC7A 1080 
7CTCCT7CCC AACCGA7AGA CGG77GGGTA A7CAC7CCAA GCCT77AGGC CC777T7AAA 1140 
CACTCCAGGC C7T77GGCCC 7Gr7777T77 777717777 1179 



C0 0 3 7] mmmn: 6 
mma&z : 2 0 

mWOM : 

mom : -*sa 

EJWOfflg : cDNA to genomic RNA 

tmz&fevttm : E 
eju (#321) 

AACCTCCGCC GGGGATCAGA 2 0 

[0038] mmn-. i 

gj&MD&Z : 2 0 



40 b-$ai>--.m%m 

E?iJ©aS:cDNA to genomic RNA 

im&&%.Lttm: e 

E?»J (#18 6) 

AYGTACCCCA YGAGRTCGGC 20 

(YteTS/tttC . R\tG£fc\ZA) 

[0 0 3 9] StPMH: 8 
EJiJOfiS : 2 0 

mwam : m. 
so : mm 



—807— 



(20) 

37 

&ffl<7>ffl§i: cDNA to genomic RNA 

mm (#256) 

CGCGCGMCNA GGAARRCTTC 2 0 
(MteA££[2C o NttA, T. CifctiG . R\t 

AJ&ttG) 
[0 0 4 0] EH**: 9 

&m<D&2 : 2 0 

&m<DM : mm 

h#ay- : SMI* 

fSfflftWSk : cDNA to genomic RNA 
tttteWEl/to&tt : E 
ffijtj (#10 4) 

AGRAARRCTT CSGAGCGRTC 20 

(RfctGSfcttA » SttCS&ttG . ) 

[0 0 4 1] K3*J#^: 10 
K^lOgS : 2 0 
m\V>M : 

aso» : 20 

K^JOSS : cDNA to genomic RNA 
WaftSteLfc^tt : E 
BSfl (#13 2) 

YRC CTTGGG C AT AGGCTGAC 20 
(YttT*fcttC o RttGfcfcttA . ) 

[0042] &&mn: 1 1 

EJtJOg£ : 2 0 

: 3> 

BJ»J©«H:cDNA to genomic RNA 
4*»£»£U&*ife : E 
E5U (#1 3 3) 



1*R!¥6-3 19 563 

GARCCAWCCT GCCCAYCCYA 20 
(RfiGSfcttA . WttTSftllA . YttC*&« 
T . ) 

[0043] smt^z 1 2 

tiffl<D&Z : 2 0 

: 

®<D& : -*« 

E#JG0SS:cDNA to genomic RNA 
ffikZikfeLttm: E 
EJB (#13 4) 

CCAARAGGGA CGGGARCCTC 2 0 
(RHGSfcttA o ) 

[0 0 4 4] m$m^: 1 3 

E50<O*S : 2 0 
BH®ai : 

E3*JCDfiS: cDNA to genomic RNA 
♦»*«jeLfc3SFa5: E 

um (#135) 

RCCYTCGTTT CCRTACAGRG 2 0 
(RttGSfcttA . YttCSfcttT . ) 

[0045] mm^z 1 4 

E?*J<ag2 : 2 0 

e^ios : m 

E5W!>S»:cDNA to genomic RNA 
E 

E3*J (#2 9 6) 

GGATAGGCTG ACGTCTACCT 20 
[004 6] 
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(21) ftS¥6-3 1 9 5 6 3 

39 40 
lEWS-f : 1 5 
K#f<7*g£ : 3 0 1 1 

: (HC-G9 aaino acid) 



Het Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr 
5 10 15 

Asn Arg Arg Pro Gin Asp Val Lys Phe Pro Giy GJy Giy Gin lie 
20 25 30 

Val Giy Giy Val Tyr Leu Leu Pro Arg Arg Giy Pro Arg Val Giy 
35 40 45 

Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Giy 
50 55 60 

Arg Arg Gin Pro fie Pro Lys Aia Arg Arg Pro Glu Giy Arg ser 
65 70 75 

Trp Ala Gin Pro Giy Tyr Pro Trp Pro Leu Tyr Giy Asn Glu Giy 
80 85 90 

Cys Giy Trp Ala Giy Trp Leu Leu Ser Pro Arg Giy Ser Arg Pro 
95 100 105 

Ser Trp Giy Pro ser Asp Pro Arg Arg Arg Ser Arg Asn Leu Giy 
110 115 120 

Lys Val lie Asp Thr Leu Thr Cys Giy Phe Ala Asp Leu Het Giy 
125 130 135 

Tyr [le Pro Leu Val Giy Ala Pro Leu Giy Giy Aia Ala Arg Aia 
140 145 150 

Leu Ala His Giy Val Arg Val Leu Giu Asp Giy Val Asn Tyr Aia 
155 160 165 

Thr Giy Asn Leu Pro Giy Cys Ser Phe Ser lie Phe Leu Leu Aia 
170 1 75 1 80 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Aia Val Giy Vai Arg 
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(22) 1$M¥6-3 1 9 5 6 3 

41 42 

185 190 195 

Asn Sep Ser Gly Val Tyr His Val Thr Asn Asp Cys Pro Asn Ala 

200 205 210 

Ser Val Val Tyr Glu Thr Giu Asn Leu He Het His Leu Pro Gly 
215 220 225 

Cys Val Pro Tyr val Arg Glu Gly Asn Ala ser Arg cys Trp Val 

230 235 240 

Ser Leu Ser Pro Thr Val Ala Ala Arg Asp Ser Arg Val Pro Val 

235 240 245 

Ser Glu Val Arg Arg Arg Val Asp Ser He Val Gly Ala Ala Ala 

250 255 260 

Phe Cys Ser Ala Het Tyr Val Gly Asp Leu Cys Gly Ser He Phe 

265 270 275 

Leu Vat Gly Gin He Phe Thr Phe Ser Pro Arg His His Trp Thr 

280 285 290 

Thr Gin Asp cys Asn Cys Ser He Tyr Pro Gly His Val Thr Gly 

295 300 305 

His Arg Het Ala Trp Asp Met Het Met Asn Trp Ser Pro Thr Gly 

315 320 325 

Ala Leu Val Val Ala Gin Leu Leu Arg He Pro Gin Ala He Val 

330 335 340 

Asp Het lie Ala Gly Ala His Trp Gly Val Leu Ala Glv Leu Ala 

345 350 355 

Tyr Tyr Ser Hec Val Gly Asn Trp Ala Lys Val Val Val Val Leu 

360 365 3 70 

Leu Leu Phe Ala Glv Val Asp Ala Glu Thr Arg Val Thr Gly Gly 

375 380 385 

Ala Ala Gly His Thr Ala Phe Gly Phe Ala ser Phe Leu Ala Pro 

390 395 400 

Gly Ala Lys Gin Lys lie Gin Leu He Asn Thr Asn Gly Ser Trp 

405 410 415 
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(23) ftffl¥6-3 1 9 56 3 

43 44 

His He Asn Arg Thr Ala' Leu Asn Cys Asn 6lu Ser Leu Asp Thr 

420 425 430 

Gly Trp Leu Ala Gly Leu Leu Tyr Tyr His Lys Phe Asn Ser Ser 

435 440 445 

Gly Cys Pro Glu Arg Het Ala Ser Cys Gin Pro Leu Thr Ala Phe 

450 455 460 

Asp Gin Gly Trp Gly Pro He Thr His Glu Gly Asn Ala Ser Asp 

465 470 475 

Asp Glit Arg Pro Tyr Cys Trp His Tyr Ala Leu Arg Pro Cys Gly 

480 485 490 

He Val Pro Ala Lys Lys Val Cys Gly Pro Val Tyr Cys Phe Thr 

495 500 505 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ala Gly Val Pro 

510 515 520 

Thr Tyr Arg Trp Gly Ala Asn Glu Thr Asp Val Leu Leu Leu Asn 

525 530 535 

Asn Ser Arg Pro Pro Het Gly Asn Trp Phe Gly Cys Thr Trp Hec 

540 545 550 

Asn Ser Ser Gly Phe Thr Lys Thr Cys Gly Ala Pro Ala Cys Asn 

555 560 575 

He Gly Gly Ser Gly Asn Asn Thr Leu Leu Cys Pro Thr Asp Cys 

5 70 5 75 580 

Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly 

585 590 595 

Pro Trp Leu Thr Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu 

600 605 610 

Trp His Tyr Pro Cys Thr Val Asn Tyr Thr lie Phe Lys He Arg 

615 620 625 

Hec Phe Val Gly Gly Val Glu His Arg Leu Asp Ala Ala Cys Asn 

630 635 640 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Asp Asp Arg Asp Arg Ala 
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(24) 8>HJ¥6-3 1 9 5 6 3 

45 46 

«0 655 660 
filu Leu See Pro Leu Leu Leu Ser Thr Thr Gin Trp Gin Val Leu 

665 670 675 
Pro Cys Ser J>he Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu Me 

m 685 690 

His Leu His Gin Asn Me Val Asp Val Gin Tyr Leu Tyr Gly Leu 

695 TOO 70s 

Ser Ser Ala Val Thr Ser Trp Val lie Lys Trp Glu Tyr Val Val 

710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg ire Cys Ala cys Leu 

725 730 735 

Trp Het Het Leu Leu He Ser Gin Val Glu Ala Ala Leu Glu Asn 

740 745 7 5 o 

Leu He Val Leu Asn Ala Ala Ser Leu Val Gly Thr His Gly lie 
.755 760 765 

Val Pro Phe Phe He Phe Phe Cys Ala Ala Trp Tyr Leu Lys Gly 
™ 775 780 

Lys Trp Ala Pro Gly Leu Ala Tyr Ser Val Tyr Gly Het Trp Pro 
785 790 795 

Leu Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu 
MO 805 8io 

Asp Gin Glu Leu Ala Ala Ser Cys Gly Ala Thr Val Phe He Cys 
J10 815 &20 

Leu Ala Val Leu Thr Leu Ser Pro Tyr Tyr Lys Gin Tyr Het Ala 
"0 835 440 

Arg Gly He Trp Trp Leu Gin Tyr Het Leu Thr Arg Ala Glu Ala 
845 850 355 

Leu Leu Gin Val Trp Val Prop ro Leu Asn Ala Arg Gly Gly Arg 

860 865 3 

Asp 6ly Val Val Leu Leu Thr Cys Val Leu His Pro His Leu Leu 
885 890 895 
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(25) 1#H¥6-3 1 9 56 3 

47 48 

Phe Glu Ne Thr Lys lie Met leu Ala lie Leu Gly Pro Leu Trp 
900 S05 910 

He Leu Gin Ala Ser Leu Leu Lys Yal Pro Tyr Phe Val Arg Ala 
915 g20 925 

His Gly Leu He Arg leu Cys Het Leu Val Arg Lys Thr Ala Gly 
930 935 940 

Gly Gin Tyr va IGIn Her. Ala Leu Leu Lys Leu Gly Ala Phe Ala 
945 950 955 

Gly Thr Tyr fie Tyr Asn His Leu Ser Pro Leu Gin Asp Trp Ala 
960 965 970 

His Ser Gly Leu Arg Asp Leu Ala Val Ala Thr Glu Pro Val He 
975 9S0 985 

Phe Ser Arg Het Glu He Lys Thr He Thr Trp Gly Ala Asp Thr 
990 $95 iflflo 

Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser Ala Arg 
1005 1010 1015 

Arg Gly Arg Glu Val Leu Leu Gly Pro Ala Asd Ala Leu Thr Asp 
1«20 1025 1030 

Lys Gly Trp Ara Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin 
1035 1040 J045 

Thr Arg Gly Leu Leu Gly Cys lie He Thr Ser Leu Thr Gly Arg 
1050 1055 10S0 

Asp Lys Asn Gin Val Gl uGly Glu Val Gin He Val Ser Thr Ala 
1065 1070 1075 

Thr Gin Thr Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 
1080 1085 1090 

Val Tyr His Gly Ala Gly Ser Arg Thr lie Ala Ser Ala Ser Gly 
1095 1100 1105 

Pro Val He Gin net Tyr Thr Asn Val Asp Gin Asp Leu Val Gly 
1110 11J5 1120 

Trp Pro Ala Pro Gin Gly Ala Arg ser Leu Thr Pro Cys Thr Cys 
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(26) 1$B!¥6-3 1 9 5 6 3 

49 50 

1025 1030 1035 

Gly Ala Ser Asp Leu Tyr Leu val Thr Arg His Ala Asp Val He 

1040 1045 1050 

Pro Val Arg Arg Arg Gly Asp Asn Arg Gly Ser Leu Leu Ser Pro 

1055 1060 1065 

Arg Pro II* Ser Tyr Leu Lys Gly Ser ser Gly Gly Pro Leu Leu 

1070 1075 1080 

Cys Pro Het Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 

1085 1090 1095 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser 

1100 1105 1110 

leu Glu Thr Thr Het Arg Ser Pro Val Phe Thr Asp Asn Ser Ser 

1115 1120 1125 

Pro Pro Thr Val Pro Gin Ser Tyr Gin val Ala His Leu His Ala 

1130 1135 1140 

Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala 

1145 1150 1155 

Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1160 1165 1170 

Thr Leu Gly Phe Gly Ala Tyr Mec Ser Lys Ala His Gly Me Asp 

1175 1130 1135 

Pro Asn Val Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro 

1190 1195 1200 

lie Thr His Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 

1205 1210 1215 

Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 

1225 1230 1235 

Val Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin 

1240 1245 1250 

Ala Glu Thr Ala Gly Val Arg Leu Thr lie Leu Ala Thr Ala Thr 

1255 1260 1265 
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(27) 4$M¥6-3 1 9 5 6 3 

51 52 

Pro Pro Glv Ser Val Thr Val Pro His Ser Asn He Glu 6!u Val 

1270 1275 1280 

Ala Leu Ser Thr Glu Gly Glu He Pro Phe Tyr 6ly Lys Ala He 

1385 1290 1295 

Pro Leu Asn Tyr Me Lys Gly Gly Arg His Leu lie Phe Cys His 

1300 1305 1310 

Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Val Gly Leu 

1315 1320 1325 

Gly val Asn Ala Val Ala Phe Tyr Arg Gly Leu Asp Val Ser Val 

1330 1335 1340 

lie Pro Thr Thr Gly Asp Val Val Val Val Ala Thr Asp Ala Leu 

1345 1350 1355 

et Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn 

1360 136S 1370 

Thr Cys Val Val Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe 

1375 13S0 1385 

Ser lie Glu Thr Ser Thr Val Pro Gin Asp Ala Val Ser Arg Ser 

1390 1395 1400 

Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys His Gly He Tyr Arg 

1405 1410 1415 

Tyr Val Ser Pro Gly Glu Arg Pro Ser Gly Het Phe Asp Ser Val 

1420 1525 1430 

Val Leu Cys Glu CysTyr Asp Ala Gly Cys Ala Trp Tyr Gfu Leu 

1435 1440 1445 

Thr Pro Ala Gfu Thr Thr Val Arg Leu Arg Ala Tyr Leu Asn Thr 

1450 1450 1455 

Pro Glv Leu Pro Val Cys Gin Aso His Leu Glu Phe Trp Glu Ser 

1465 1470 1475 

Val Phe Thr Gly Leu Thr His He Asp Ala His Phe Leu Ser Gin 

1480 1485 1490 

Thr Lys Gin Ser Gly Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin 
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53 54 

1495 1500 15 os 

Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp 
1510 1515 1520 

Gin Het Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu Thr Gly 
1525 1530 1535 

Ala Thr Pro Leu Leu Tyr Arg Lew Sly Gly Val Gin Asn Glu He 
1M0 1545 1550 

Thr Leu Thr His Pro He Thr Lys Tyr He Het Ala Cys Met Ser 
1555 1560 1565 

Ala Asp Leu Glu Val Val Thr Ser Thr Trp val Leu Val Gly Gly 
15 N> 1575 1580 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Thr Gly ser Val 
1585 1590 1595 

Val lie Val Gly Arg He He Leu Ser Gly Lys Pro Ala Val lie 
1M0 1605 1610 

Pro Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Gfu Het Glu Glu 
1615 1620 1625 

Cys Ala Ala His He Pro Tyr Leu Glu Gin Gly Het His Leu Ala 
"30 1635 1S40 

Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
164 5 1650 1655 

Lys Gin Ala Glu Thr He Thr Pro Ala Val His Thr Asn Trp Gin 
1660 1665 1670 

Lys Leu Glu ser Phe Trp Ala Lys His Het Trp Asn Phe Val Ser 
1675 1680 1685 

Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro 
1690 1695 1700 

Ala lie Ala Ser Leu Het Ser Phe Thr Ala Ala Val Thr Ser Pro 
"05 1710 1715 

Leu Thr Thr filn Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp 
1720 1725 1730 
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55 56 

Val Afa Ala Gin Leu Ala Ala Pro Ala Ala Ala Thr Ala Phe Val 

1735 1740 1745 

Gly Ata Gly He Thr Gly Ala Val He Gly Ser Val Gly Leu Gly 

1750 1755 1760 

Lys Val Leu Val Asp Ne Leu Ala Gly Tyr Gly Ala Gly Val Ala 

1765 1770 1775 

Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Ala Pro Thr 

1780 1715 1790 

Ala Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro Gly 

1795 1300 1805 

Ala Leu Val Val Gly Val Vat Cys Ala Ala He Leu Arg Arg His 

1810 1815 1820 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 

1825 1830 1835 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val 

1840 1845 1850 

Pro Glu Ser Asp Ala Ser Val Arg Val Thr His He Leu Thr Ser 

1855 1860 1865 

Leu Thr Val Thr Gin Leu Leu Lys Arg Leu His Val Trp lie Ser 

1870 1875 1880 

Ser Asp Cys Thr Ala Pro Cys Ala Gly Ser Trp Leu Lys Asp Val 

1885 1890 1895 

Trp Asp Trp He Cys 6lu Val Leu Ser Asp Phe Lys Ser Trp Leu 

1900 1905 1910 

Lys Ala Lys Leu Ket Pro Gin Leu Pro Gly He Pro Phe Val Ser 

1915 1920 1925 

Cys Gin Arg Gly Tyr Arg Gly Val Trp Arg Gly Glu Gly He Hec 

1930 1935 1940 

l!is Ala Arg Cys Pro Cys Gly Ala Asp Ne Thr Gly His Val Lys 
1945 1950 1955 

Asn Gly Ser Hec Arg He Val Gly Pro Lys Thr Cys Ser Asn Thr 
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57 58 

I960 1965 1970 

Trp Arg Gly Ser Phe Pro He Asn Ala His Thr Thr Gly Pro Cys 
1975 19*0 1985 

Thr Pro Ser Pro Ala Pro Asn Tyr Thr Phe Ala leu Trp Arg Val 
1990 1995 2000 

Ser Aia Glu Glu Tyr Val Glu Val Arg Arc Leu Gly Asp Phe His 
2005 2010 2015 

Tyr He Thr Gly Val Thr Thr Asp Lys lie Lys Cys Pro Cys Gin 
2020 2025 2030 

Val Pro Ser Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu 
2035 2040 2045 

His Arg Tyr Ala Pro Pro Cys Lys Pro Leu Leu Arg Asp Glu Val 
2050 2055 2360 

Thr Phe Ser lie Gly Leu Asn Glu Tyr Leu Val Gly Ser Gin Leu 
2065 2070 2375 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Hec Leu 
2080 2045 2390 

Thr Asp Pro Ser His He Thr Ala Glu Thr Ala Ala Arg Argleu 
2005 2040 2445 

Asn Arg Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin 
2110 2115 2420 

Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Thr His His Asp 
2125 2130 2435 

Ser Pro Asp Ala Asp Leu lie Thr Ala Asn Leu Leu Trp Arg Gin 
2140 2145 2450 

Glu Met Gly Gly Asn fie Thr Arg Val Glu Ser Glu Asn Lys He 
2155 2160 2465 

Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Asp 
2170 2175 2480 

Arg Glu He Ser Val Pro Ala Glu He Leu Leu Lys Ser Lys Lys 
2185 2190 2495 
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59 60 

The Pro Pro Ala Met Pro lie Trp Ala Arg Pro Asp Tyr Asn Pro 
Pro Lou Val Glu Pro Trp Lys Arg Pro Asp Tyr Glu Pro Pro Leu 
Val His Gly Cys Pro Leu Pro Pro Pro Lys Pro Thr Pro Val Pro 
Pro Pro Arg Arg Lys Arg Thr Val Val Leu Asp Glu Ser Thr Val 
Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser 
Thr Thr Ser Gly Val Thr Ser Gly Glu Ala Ala Glu Ser Ser Pro 
Ala Pro Ser Cys Asp Gly Glu Leu Asp SerGiu Ala Glu Ser Tyr 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Asp Gly Gly Thr Glu 
Asp Val Val Cys Cys Ser Het Ser Tyr ser Trp Thr Gly Ala Leu 
He Thr Pro Cys Ala AlaG lu Glu Thr LysLeu Pro Me Asn Ala 
Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
Thr Ser Arg Ser Ala Gly Gin Arg Gin Lys Lys Val Thr Phe Asp 
Arg Leu Gin Val Leu Asp Asp His Tyr Arg Asp Val Leu Lys Glu 
Ala Lys Ala Lys Ala Ser Thr Val Lys AlaLys Leu Leu Ser Val 
Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Arg Ser Lys 
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61 62 

Phe Gly Tyr Gly Ala Lys Asp Val Arg Ser His Ser Ser Lys Ala 
Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Arg Ser Lys 
Tyr Asp Val Val Lys Gin Leu Pro He Ala Val Het Gly Thr Ser 
Cys Val Lys Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He 
Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Arg Ala Leu 
Tyr Asp Val Val Lys Gin Leu Pro He Ala Val Het Gly Thr Ser 
Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Asp Phe Leu Leu 
Asn Ala Trp Lys Ser Lys Lys Asn Pro Het Gly Phe Ser Tyr Asp 
Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ala Asp He Arg Thr 
Glu Glu Asp Leu Tyr Gin Ser Cys Asp Leu Val Pro Glu Ala Arg 
Ala Ala He Arg Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro 
Leu Thr Asn Ser Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg 
Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr He Thr Cys 
Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala Ala Lys Leu Arg Asp 
Cys Thr Het Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu 
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63 64 

Ser Ala Gly Val Gin Glu Asp Aia Ala Asa Leu Arg Ala Phe Thr 

6lu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin 

Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val 

ser Val Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr 

Arg Asp Pro Glu Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala 

Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn Ne He Het Phe 

Ala Pro Thr Leu Trp Val Arg Met Val Leu Met Thr His Phe Phe 

Ser He Leu He Ala Gin Glu His Leu Glu Lys Ala Leu Asp Cys 

Glu He Tyr Gly Ala Val His Ser Val Gin Pro Leu Asp Leu Pro 

Glu He He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His 

Ser Tyr Ser Pro Gfy Glu He Asn Arg Val Afa Ala Cys Leu Arg 

Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala Arg 

Ser Val Arg Ala Thr Leu Leu Ser Gin Gly Gly Arg Ala Ala He 

s Gly Lys Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys 

Leu Thr Pro Leu Pro Ser Ala Ser Gin Leu Asp Leu Ser Asn Trp 

Phe Thr Gly Gly Tyr Ser Gly Gly Asd He Tyr His Ser Val Ser 
His Val Arg Pro Arg Trp Phe Phe Trp Cys Leu Leu Leu Leu Ser 

Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 

[0047] 
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65 

titm^: 16 
E$>i<SO£$ : 5 0 2 

Mlflomm. gsif (YSnr-5' aiino acid) 

HetSerThrAsnPro LysProGlnArgLys ThrLysArgAsnThr AsnApgArgProGln 
AspValLysPhePro GlyGlyGfyGlnlle ValGlyGlyValTyr UttLeuPpoArgArg 
GlyProApgValGly ValArgAlaThrArg LysThrSerGluArg SerGlnProArgGly 
ArgApgGlnProHe ProLysAlaArgArg ProGluGlyArgSer TppAlaGlnPpoGly 
TyrProTrpProleu TyrGlyAsnGluGly CysGlyTrpAlaGly TrpleuLeuSerPro 
ArgGlySerArgPro SerTrpGlyProThr AspProArgArgArg SerApgAsnLeuGly 
LysVallleAspThp LeuThrCysGlyPhe AlaAspLeuHetGiy TyrlleProLeuVal 
GlyAIaPPOleuGly GlyAlaAlaArgAla LeuAlaHisGlyVal ArgValLeuGluAsp 
GlyValAsnTypAla ThpGlyAsnLeuPro GlyCysSerPheSer IlePheLeuleuAla 
LeuLeuSepCysLeu ThpValPpoAlaSer AlaValGluValArg AsnSopSerGlyVal 
TypHisValThPAsn AspCysProAsnAIa SepValValTyrGfu ThrGfuserLeuI le 
HetHisLeuProGly CysValProCysVal ArgGluGlyAsnAIa SerArgCysTppVal 
SerLeuSepppoThr I leAlaAlaLysAsp ProSerValProVal SerGlulleArgArg 
HisValAspLeuIle ValGlyAlaAJaAla PheCysSerAlaHet TyplfalGfyAspLeu 
CysGlySerllePhe LeuValGlyGlnfle PheThpPheSerPro ArgArgHisTrpThp 
ThrGlnAspcysAsn CysSeplleTyppro GlyHisValThrGly HisArgMetAlaTrp 
AspHetMetftetAsn TrpSerProThrGly AlaLeuValHetAla GlnteuLeuArglle 
ProGlnAlaValVal AspHetl leAJaGly AlaHisTrpGlyVal LeuAlaGlyleuAfa 
TyrTyrSerftetVal GJyAsnfrpAlalys ValValValValLeu LeuLeuPheAlaGly 
ValAspAlaAspThr GlnValThrGlyGly SerAlaAfaTyrAsp AlaArgGiyieuAla 
SerLeuPheThrPro GlyppoLysGlnAsn IleGlnLeuIleAsn ThrAsnGlySepTrp 
HisIleAsnArgThr AlaLeuAsnCysAsn GluSepLeuAsnThr GlyTrpValAlaGly 
Lei/PheTyrTyrHis LysPheAsnSerSer GlyCysProGfuArg HetAlaSerCysGln 
ProLeuThrAlaPte AspGlnGiyTppGly PpoI leThrTyrGlu GlyAsnAlaSerGly 
AspGlnApgProTyr CysTrpHisTypAla ProArgProCysGly I feValProAJaArg 

GluVal [0 0 4 8] 
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67 

mm^ ■ 1 7 

fi*J*>£S : 3 6 5 

E*I<oa3K : fiBSf CYS117-3* amino acid) 

ThrValThrGluAla AsplleArgThrGlu GluAspLeuTyrGIn SerCysAspleuVal 
ProGluAlaArgThr AlalleArgSerleu ThrGluArgLeuTyr ileGlyGlyProLeu 
ThrAsnSerLysGly GlnAsnCysGlyTyp ArgArgCysArgAla SerGlyVal LeuThr 
ThrSerCysGlyAsn ThrlleThrCysTyr LeuLysAlaSerAla AUCysArgAiaAla 
LysLeuGlnAspCys ThrHetLeuValCys GlyAspAspLeuVal Val IleCysGluSer 
AlaGlyValGlnGIu AspAlaAiaSerleu ArgAlaPheThrGlu AlaHetThrArgTyr 
SerAlaProProGly Asp ProP roG I nP ro GluTyrAspLeuGlu LeuIleThrSerCys 
SerSerAsnValSsr ValAlaHisAspGly SerGlylysArgVal TyrTyrLeuThrArg 
AspProGluThrPro LeuAlaArgAlaAla TrpGluThrAlaArg HisThrProValAsn 
SerTrpLeuGlyAsn IlelleftetPheAia ProThrLeuTrpVal ArgHetValLeuHet 
ThrHisPhePheSer IleLeuI leAlaGIn GluH i sLeuGluLys AlaLeuAspCysGlu 
IleTyrGlyAiaVal H isSerValGI nPro LeuAspLeuProGlu HelleGlnArgleu 
HisGlyLeuSerAla PheSerLeuHisSer TyrSerProGlyGlu lleAsnArgValAla 
AlaCysLeuArgLys LeuGIyValProPro LeuArgAlaTrpArg HisArgAlaArgSec 
ValArgAlaThrLeu LeuSerGlnGlyGIy LysAlaAlalleCys GlyLysTyrLeuPhe 
AsnTrpAlaValLys ThrLysLeuLysLeu ThrProLeuProSer AlaSerGlnLeuAsp 
LeuSerAsnTrpPhe ThrGlyGlyTyrSer GlyGlyAspIleTyr HisSerValSerHis 
ValArgProArgTrp PhePheTrpCysLeu LeuLeuLeuSerVa! GlyValGlyNeTyr 
LeuLeuProAsnArg 

[0 049] 
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68 
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69 70 

E#fa£3 : 5 0 2 

WMnmm. : (SR037-5" aaino acid) 



HetSerlhrAsnPro LysProGlnArgLys ThrLysArgAsnThr AsnArgArgProGIn 
AspValLysPhePro GlyGlyGlyGlnlle VafGlyGlyVafTyr LeuLeuProArgArg 
GlyProArgHetGly ValArgAlaThrArg LysThrSerGluArg SerGlnProArgGly 
ArgArgGlnProIle ProLysAlaArgArg ProGluGlyArgSer TrpAlaGlnProGly 
TyrProTrpProLeu TyrGlyAsnGluGly CysGlyTrpAlaGly TrpLeuLeuSerPro 
ArgGlySerArgPro SerTrpGlyProSer AspProArgArgArg SerArgAsnLeuGly 
LysVallleAspThr LeuThrCysGlyPhe AfaAspLeuHecGly TyrlleProLeuVal 
GlyAlaProleuGly GJyAlaAlaArgAla LeuAlaHisClyVal ArgVaiLeuGluAap 
GlyvalAsnTyrAla ThrGlyAsnLeuPro GlyCysSerPheSer IlePheleuLeuAla 
LeuLeuSerCysleu ThrValProAlaSer AlaValGlyValArg AsnSsrSerGlyVal 
TyrHisValTfirAsn AspCysProAsnAla SerValValTyrGlu ThrGluSerleulle 
IleHisleuProGly CysValProCysVal ArgGluGlyAsnAla SerArgCysTrpVal 
SerLeuSerProThr ValAlaAlalysAsp ProSerValProVal SerGlulleArgArg 
HisValAspLeuIle VafGlyAlaAlaAla PheCysSerAlaMet TyrValGlyAspLeu 
CysGlySerNePhe LeuValGlyGlnlle PheThrLeuSerPro ArgArgHisTrpThr 
ThrGlnAspCysAsn CysSerlleTyrPro GlyHisValThrGly HisArgHetAlaTrp 
AspHetHetHecAsn TrpserProThrGly AlaLeuValVafAla GlnLeu Leu Argils 
ProGlnAlaValVal AspMetl leAlaG I y AlaHisTrpGlyVa! LeuAlaGlyLeuAla 
TyrTyrSerHetVaf GlyAsnTrpAlaLys ValValValValLeu LeuLeuPheAlaGly 
ValAspAfaGluThr GlnValSerGlyGly SerAlaAlaGlnThr ThrTyrGlyleuThr 
AfaLeuPheArgrhr GlyProAsnGlnLys MeGlnLeuIleAsn ThrAsnGlySerTrp 
HisHeAsrtArgThr AlaLeuAsnCysAsn GluSerleuHisThr GlyTrpLeuAlaAla 
LeuPheTyrThrHis LysPheAsnSerSer GlyCysLeuGluArg HetAlaSerCysGIn 
ProLeuSerAlaPhe AspGlnGlyTrpGly ProIleThrTyrGly GlyAsnAlaSerAsp 
GlyGlnArgProTyr CysTrpHisTyrAla ProArgProCvsGly IleValProAlaArg 

GluVal [0 0 5 0] 
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71 72 1 

E#ftf>£$ :365 

EM^Whfieff (SR037-3' amino acid) 

ThrValThrGluAla AspIleArgThrGIu GluAspLeuTyrGln SerCysAspLeuVal 
ProGluAlaArgThr AlalleArgSerLeu ThrGluArgLeuTyr HeGlyGlyProLeu 
ThrAsnSerLysGly GlnAsnCysGlyTyr ArgArgCysArgAla SerGlyVal LeuThr 
ThrSerCysGlyAsn ThrlleThrCysTyr LeuLysAlaSerAla AlaCysArgAlaAla 
LysLeuArgAspCys ThrHetLeuValCys GlyAspAspLeuVal VallleCysGluSer 
AlaGlyValGlnGlu AspAlaAlaSerLeu ArgAlaPheThrGlu AlaHetThrArgTyr 
SerAlaProProGly AspProProGlnPro GfuTyrAspLeuGlu LeuIIeThrSerCys 
SerSerAsiWalSer ValAlaHisAspGly AiaGlyLysArgVal TyrTyrLeuThrArg 
AspProGluThrPro LeuAlaArgAlaAla TrpGluThrAlaArg HisThrProValAsn 
SerTrpleuGlyAsn Ilet JeMetPHeAla ProThrLeuTrpVa! ArgMetValLeuHet 
TftrHisPhePheSer IleLeuIleAJaGIn GluHisLeuGluLys AlaLeuAspCysGiu 
UeTyrGlyAlaVal HisSerMeGlnPro LeuAspleuProGlu IlelleG/nArgLeu 
HisGiyLeuSerAla PlieSerLeuHisSer TyrSerProGlyGlu IleAsnArgValAla 
AlaCysLeuArgLys LeuGlyValProPro LetiArgAlaTrpAnj HisArgAlaArgSer 
ValArgAlaThrLeu LeuSerGlnGlyGly LysAlaAlalleCys GlyLysryrLeuPhe 
AsnTrpAlaValLys ThrlysleuLysLeu ThrProleuProSer AlaSerGlnLeuAsp 
LeuserAsnTrpPhe ThrGlyGlyTyrAsn GlyGlyAspIleTyr HisSerValSerArg 
ValArgProArgTrp PhePheTrpCysLeu LeuLeuLeuSerVa! GlyvalGlylleTyr 
LeuLeuProAsnArg 

mil 
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